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PROBABILITY  AND  ITS  APPLICATION  TO  ANTISUBMARINE  WARFARE 

Introduction 

One  of  the  most  important  characteristics  of  modern  ASW  (Antisubmarine 
Warfare)  is  the  existence  of  so  many  uncertainties  and  unknowns.   Examples 
of  these  "unknowns"  abound  in  almost  every  aspect  of  this  warfare  area. 
To  a  Tactical  Coordinator  on  a  search  aircraft  the  "unknown"  may  be  the 
evasive  maneuvers  an  enemy  submarine  skipper  will  employ,  how  many  of  the 
sonobuoys  he  has  aboard  his  aircraft  are  defective,  or  which  portion  of 
a  search  region  offers  the  best  prospects  for  a  contact.   To  a  CIC  officer 
the  presence  of  a  submarine  within  detection  range  may  be  unknown  because 
of  excessive  ambient  noise;  at  the  time  a  contact  is  gained  it  may  be 
unknown  to  him  whether  the  contact  is  in  fact  a  submarine,  and  then  the 
target  location  accuracy  may  be  unknown  because  of  uncertainties  in 
underwater  sound  propagation. 

Because  of  the  existence  of  so  many  unknowns,  many  Naval  Officers  who 
work  in  ASW  are  tasked  with  making  action  recommendations  and  action 
decisions,  which  are  based  on  predictions  of  the  unknown  in  the  light  of 
all  that  _is  known.   Therefore,  success  in  the  overall  ASW  effort  is 
directly  related  to  the  capability  within  the  ASW-related  officer  corps 
in  two  key  areas:   (i)   making  predictions  concerning  the  unknown,  and 
(ii)  understanding  the  concomitant  degree  of  accuracy  and  degree  of 
uncertainty  of  such  predictions,  so  that  the  best  possible  decisions  may 
then  be  made. 

Probability  theory  provides  a  logical  and  scientific  basis  for  predic- 
tion and  decision  making.   It  is  not  surprising,  therefore,  that 
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probabilistic  concepts  and  the  language  of  probability  are  in  widespread 
everyday  usage  in  the  ASW  community  today.   Even  the  basic  references 
used  in  ASW,  such  as  ComTac  publications,'  ASW  handbooks,  Tactical 
Reference  manuals,  TacAids,  ASRAPS  and  AESPAWS,  use  the  language  and 
concepts  of  probability  theory  freely. 

Because  of  the  widespread  use  of  probability  in  the  tasks  assigned 
to  the  ASW-related  Officer,  his  effectiveness  should  be  enhanced  by  an 
increased  understanding  of  probability  theory  and  a  better  appreciation 
of  its  benefits  in  ASW.   This  report  constitutes  a  "tutorial  paper" 
designed  to  provide  a  means  by  which  ASW-related  officers  may  better 
understand  application  of  probability  in  modern  ASW.   The  objective  of 
this  project  has  been  to  distill  from  the  general  field  of  probability 
those  elements  that  are  vital  in  ASW  applications,  and  to  describe  these 
elements,  along  with  typical  examples  of  their  application,  in  an 
organized  and  readable  presentation  that  is  accessible  to  any  ASW-related 
Officer,  with  the  only  prerequisites  being  basic  algebra  and  geometry 
as  applied  to  tactical  situations  in  his  working  environment. 

This  paper  has  been  organized  into  the  format  of  an  academic  course, 
including  lesson  plans,  study  guides  and  suggested  textbooks.   This 
organization  was  adopted  because  it  is  expected  that  this  paper  will 
serve  as  the  basis  for  a  future  short  course  that  will  be  available  to 
the  Fleet  through  the  Continuing  Education  Office  at  the  Naval  Postgraduate 
School.   Because  of  the  possible  misunderstanding  that  may  result  from 
the  "course  format"  adopted  here,  it  is  emphasized  that  the  content  of 
this  paper  does  not  constitute  a  Naval  Postgraduate  School  course.   It  is 
intended  that  an  official  course  be  developed  from  these  notes;  this, 
however,  involves  an  evolutionary  process,  including  testing  of  the 
materials  with  actual  students,  and  subsequent  revisions. 
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The  Lesson  Plans  provide  the  objectives  of  each  unit;  they  indicate 
what  is  to  be  learned,  the  depth  required,  and  the  behavioral  goals.   The 
Study  Guides  serve  as  the  basic  textbook.   In  addition,  Schaum's  Outline 
of  Theory  and  Problems  of  Probability  by  S.  Lipschutz  (1965)  and  Schaum' s 
Outline  of  Theory  and  Problems  of  Statistics  by  M.  R.  Spiegel  (1961), 
both  by   the  McGraw-Hill  Book  Company,  are  very  helpful;  these  texts 
are  inexpensive  and  easy  to  follow,  as  they  contain  many  samples  with 
detailed  solutions.   Reference  is  made  to  the  appropriate  sections  in 
the  Schaum's  texts  at  the  end  of  each  chapter. 

The  primary  material  presented  is  the  probability  theory  required 
for  successful  ASW  application,  and  examples  to  illustrate  the  applica- 
tion.  The  probability  theory  is  preceded  by  a  presentation  of  sufficient 
statistical  procedures  to  serve  as  "first  principles"  on  which  the 
probability  is  based. 


Lesson  Plan  No.  S-V-l  Instructor: 


Time:   65  min.  Subject  Area:   Probability  and  Its 

Applications  to  Anti- 
submarine Warfare 

Date: Topic:   Data  Reduction,  Frequency 

Distribution 

Reference:   ASW  Study  Guide;  Schaum's 
Outline  Series,  Theory 
and  Problems  of  Statistics 

Title:   "Sample  Data  Reduction  and  Frequency  Distributions" 

Materials:   Transparencies  of  Table  1,  Table  2,  Table  3,  Figure  1, 
Figure  2,  and  Figure  3 

Objectives:   (1)   Acquaint  the  student  with  the  concepts  of  statistical 

population  and  samples  from  the  population;  the 
student  should  be  able  to  explain  these  concepts  by 
means  of  simple  examples. 

(2)   Discuss  the  basic  terms  required  to  describe  frequency 
distributions,  and  show  mathematically  how  they  are 
utilized.   The  student  should  be  able  to  determine  the 
frequency  distribution  from  a  given  set  of  raw  data. 

Introduction:   Instructor  should  inform  students  of  purpose  of  lesson 
and  briefly  highlight  important  points  to  be  covered. 

Presentation:   Discuss  the  following  points: 

1.  Population 

2.  Sample 

3.  Methods  of  describing  a  sample  and  estimating 
characteristics  of  the  population 

4.  Raw  data 

5.  Arrays 

6.  Classes  or  categories 

7.  Class  frequency  and  frequency  table 

8.  Class  intervals  and  class  limits 

9.  Class  boundaries 

10.   Size  or  width  of  a  class  interval 
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11.  Class  mark 

12.  General  rules  for  forming  frequency  distributions: 

a .  The  range 

b.  Number  of  class  intervals 

c.  Number  of  observations  falling  into  each  class 
interval 

13.  .  Histograms  and  frequency  polygons 

14.  Relative  frequency  distributions 

15.  Cumulative  frequency  distributions  or  ogives 

16.  Relative  cumulative  frequency  distributions 

17.  Types  of  frequency  curves 

a.  Symmetrical  or  bell-shaped 

b.  Skewed  to  the  right 

c.  Skewed  to  the  left 

d.  U-shaped 

e.  Bimodal 

f.  Multimodal 

Summary:   Review  material  covered,  reemphasizing  key  points.   Ask 
questions  to  determine  effectiveness  of  instruction. 
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Lesson  Plan  No.  S-V-2  Instructor: 


Time:   115  min.  Subject  Area:   Probability  and  Its 

Applications  to  Anti- 
submarine Warfare 

Date: Topic:   Measures  of  Central  Tendency 

and  Dispersion 

Reference:   ASW  Study  Guide;  Schaum's 
Outline  Series,  Theory 
and  Problems  of  Statistics 

Title:   "Special  Measures  of  Sample  Data—Central  Tendency  and  Dispersion" 

Materials:   Transparency  of  Figures  4  through  6 

Objectives:   (1)   Define  central  tendency  and  its  measurement  from  sample 

data.   Illustrate  and  develop  the  sample  mean,  median, 
and  mode.   The  student  shall  be  able  to  calculate  the 
mean,  median,  and  mode  from  given  sample  data. 

(2)   Define  dispersion  and  its  measurement  from  sample  data. 
Illustrate,  and  develop  the  listed  measures  of  disper- 
sion so  that  the  student  is  capable  of  calculating 
the  range,  standard  deviation  and  variance  from  given 
sample  data. 

Introduction:   Instructor  should  inform  students  of  purpose  of  lesson 
and  briefly  highlight  important  points  to  be  covered. 

Presentation:   Discuss  the  following  points: 

1.  Index  or  subscript  notation 

2.  Summation  notation 

3.  Measures  of  central  tendency 

a.  Mean 

(1)  (Arithmetic)  Mean 

(2)  Weighted  (arithmetic)  mean 

b.  Median 

c.  Mode 

4.  The  concept  of  dispersion  of  data 

5.  Specific  measures  of  dispersion 

a.  Range 

b.  Standard  Deviation 
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(1)  Formulation  of 

(2)  Short  method  of  computing 

c.  Variance 

d.  Sigma-values  for  Normal  probability  distributions 
with  corresponding  probability  values. 

Summary:   Review  material  covered,  reemphasizing  key  points.   Ask. 
questions  to  determine  effectiveness  of  instruction. 
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Lesson  Plan  No.  S-V-3 
Time:   20  min. 

Date:  


Instructor: 


Subject  Area:   Probability  and  Its 

Applications  to  Anti- 
submarine Warfare 

Topic:   Combinatorial  Analysis 

Reference:   ASW  Study  Guide;  Schaum's 
Outline  Series,  Theory 
and  Problems  of  Statistics 


Title:   "Combinatorial  Analysis" 

Objective:   Develop  the  concept  of  COMBINATORIAL  ANALYSIS  by  defining  and 
illustrating,  with  a  simple  example,  the  basic  elements  of 
this  analysis.   The  student  should  be  able  to  perform 
combinatorial  analysis  on  a  given  example  by  utilizing  the 
fundamental  counting  principle,  and  calculating  permutations/ 
combinations  to  find  the  number  of  possible  outcomes  for 
the  example. 

Introduction:   Instructor  should  inform  students  of  purpose  of  lesson 
and  briefly  highlight  important  points  to  be  covered. 

Presentation:   Discuss  the  following  points: 

1.  The  fundamental  counting  principle 

2.  Factorial  n 

3.  Combinations 

4.  Permutations 

Summary:   Review  material  covered,  reemphasizing  key  points.   Ask 
questions  to  determine  effectiveness  of  instruction. 


Lesson  Plan  No.    S-V-4 
Time:      150  mln. 

Date: 


Instructor: 


Subject  Area : 


Probability  and  Its 
Applications  to  Anti- 
submarine Warfare 


Topic:   Elementary  Probability  Theory 


Reference: 


ASW  Study  Guide;  Schaum' s 
Outline  Series,  Theory 
and  Problems  of  Statistics; 
and  Schaum1 s  Outline  Series, 
Theory  and  Problems  of 
Probability 


Title:   "Elementary  Probability  Theory" 


Materials:   Transparencies  of  Figures  5  through  12 
Objectives: 


(1) 


Introduction: 


Presentation: 


Develop  student  intuition  in  the  basic  concepts  of 

probability,  i.e.  his  insight  into  what  probability  is. 

Develop  the  concepts  of  basic  probability  through  the 

use  of  the  proportional  areas  notion  of  probability. 

The  student  should  be  able  to  compute  simple  probabilities 

given  sample  data  and  simple  statement  of  necessary 

conditions. 


(2)  Develop  the  concept  of  CONDITIONAL  PROBABILITY  as  an 
improved  probability  resulting  from  having  previously 
eliminated  part  of  the  overall  sample  space  to  be 
"drawn  on,"  through  use  of  prior  information.   The 
student  should  be  able  to  compute  conditional 
probabilities  utilizing  any  of  the  three  listed  methods, 

(3)  Develop  the  concept  of  STATISTICAL  INDEPENDENCE,  and 
delineate  between  this  and  MUTUALLY  EXCLUSIVE  EVENTS. 
Also,  show  the  utility  of  statistical  independence  of 
events  in  computing  a  simple  joint  probability.   Given 
two  events,  the  student  should  be  able  to  state 
whether  or  not  they  are  statistically  independent. 

The  instructor  should  inform  students  of  purpose  of  lesson 
and  briefly  highlight  important  points  to  be  covered. 

Discuss  the  following  points: 

1.   Elementary  Probability  Theory 

a.   Background  and  concepts  of  probability 

(1)  "Odds"  or  ratio  concept 

(2)  Proportional  areas  concept 
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b.  Sample  Space 

c.  Probability  Experiment 

d.  Outcomes 

e.  Events 

f.  Probability  Function 

g.  Probability  Complement 

2.  Conditional  Probability 

a.  Concept  of 

b.  Methods  of  computing 

(1)  By  definition 

(2)  "Stating  and  Solving"--"Simply  writing 
it  down" 

(3)  Bayes'  Law 

3.  Statistical  Independence 

a.  Independent  Events;  Probability  of  Independent 
events 

b.  Independent  Events  vs.   Mutually  Exclusive  Events 

c.  Implications/applications  of  Statistical  Independence 

Summary:   Review  material  covered,  reemphasizing  key  points.   Ask 
questions  to  determine  effectiveness  of  instruction. 


4-2 


Lesson  Plan  No.  S-V-5  Instructor: 


Time:   100  min.  Subject  Area:   Probability  and  Its 

Applications  to  Anti- 
submarine Warfare 

Date: Topic:   Random  Variables 

Reference:   ASW  Study  Guide;  Schaum's 
Outline  Series,  Theory 
and  Problems  of  Probability 

Title:   "Random  Variables" 

Materials:   Transparencies  of  Table  5,  Table  6  and  Figures  15  through  19 

Objectives:   Introduce  the  concept  of  random  variables,  and  develop 

random  variables  as  a  device  for  assigning  names  to  the  out- 
comes of  a  probability  experiment.   The  student  should  be 
able  to  demonstrate  how  a  random  variable  is  used  by  means 
of  examples. 

Introduction:   Instructor  should  inform  students  of  purpose  of  lesson 
and  briefly  highlight  important  points  to  be  covered. 

Presentation:   Discuss  the  following  points: 

1.   Random  variable 

a.  discrete 

b.  continuous 

c.  density  function 

d.  cumulative  distribution  function  (CDF) 

e .  mean 

f.  variance 

g.  standard  deviation 

Summary:   Review  material  covered,  reemphasizing  key  points.   Ask 
questions  to  determine  effectiveness  of  instruction. 


Lesson  Plan  No.  S-V-6  Instructor: 


Time:   550'min.  Subject  Area:   Probability  and  Its 

Applications  to  Anti- 
submarine Warfare 

Date: Topic:   Basic  Statistical  Distributions 

Reference:   ASW  Study  Guide;  Schaum's 
Outline  Series,  Theory 
and  Problems  of  Statistics; 
and  Schaum's  Outline  Series, 
Theory  and  Problems  of 
Probability 

Title:   "Basic  Statistical  Distributions" 

Materials:   Transparencies  of  Table  7  and  Figures  20  through  29 

Objectives:   (1)   Present  examples  of  some  useful  statistical  distributions. 

(2)  The  student  should  be  able  to  give  a  simple  description 
of  each  of  the  distributions  presented,  and  compute 
probabilities  for  Bernoulli,  Binomial,  Uniform,  Chi- 
Square,  Exponential,  Poisson  and  Normal  examples 

(with  the  exception  of  the  Bivariate  Normal  and  Time- 
Varying  Normal,  which  are  presented  only  for  student 
appreciation  of  "state-of-the-art"  applications). 

(3)  It  is  intended  that  this  lesson  serve  to  extend  the 
student's  understanding  of  probability  functions, 
random  variables,  and  density  functions  and  cumulative 
density  functions. 

Introduction:   Instructor  should  inform  students  of  purpose  of  lesson 
and  briefly  highlight  important  points  to  be  covered. 

Presentation:   Discuss  the  following  points: 

1.  Background  on  the  basics  of.  specifying  a  statistical 
distribution,  drawing  the  majority  of  this  information 
directly  from  the  previous  lesson  plan. 

2.  Bernoulli  Distribution 

3.  Binomial  Distribution 

4.  Uniform  Distribution 

5.  Normal  Distributions 

a.  Normal  Distribution 

b.  Bivariate  Normal  Distribution 
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c.   Time-Varying  Normal  Distribution 

6.  Chi-Square  Distribution 

7.  Exponential  Distribution 

8.  Poisson  Distribution 

Summary:   Review  material  covered,  reemphasizing  key  points.   Ask 
questions  to  determine  effectiveness  of  instruction. 


6-2 


STUDY  GUIDE 


PROBABILITY  AND  ITS  APPLICATIONS 
TO  ANTISUBMARINE  WARFARE 


INTRODUCTION 

Probability  theory  originated  in  the  early  seventeenth  century  with 
investigations  of  various  games  of  chance.   Since  then  many  scholars 
have  made  contributions  to  the  theory.   Probability  theory  was  finally 
axiomatized  in  the  twenties  and  thirties  of  this  century.   This  axiomatic 
development,  called  Modern  Probability  Theory,  provided  for  a  precise 
formulation  of  probabilistic  concepts,  which  enabled  the  theory  to  be 
placed  on  a  firm  mathematical  foundation. 

The  importance  of  probability  has  increased  significantly  in  the 
past  twenty  years.   Today  the  notions  of  probability,  and  the  closely 
related  subject  statistics,  appear  in  almost  every  discipline.   Probability 
is  used  routinely  by  many  Naval  Officers  in  the  line  of  duty,  and  plays 
an  important  role  in  Antisubmarine  Warfare,  especially  in  the  area  of 
search  and  detection. 

This  study  guide  is  designed  for  an  introductory  course  in  probability 
and  applications  to  antisubmarine  warfare,  with  basic  algebra  as  the  only 
prerequisite.   The  guide  may  serve  as  a  text  for  such  a  course,  as  lesson 
plans  are  also  provided,  or  as  a  supplement.   In  addition,  since  this 
study  guide  is  complete  and  self-contained,  it  could  be  used  for  self- 
study. 

Chapters  I,  II,  and  III  of  the  study  guide  cover  sample  data  reduction, 
frequency  distributions,  associated  measures  of  central  tendency  and 
disperison,  and  combinatorial  analysis.   This  leads  naturally  to 
elementary  probability  theory,  conditional  probability,  statistical 
independence  and  applications,  in  Chapter  IV.   Chapter  V  covers  random 


variables.   Discrete  and  continuous  random  variables  are  covered,  with 
emphasis  on  the  use  of  tables  to  determine  numerical  probabilities. 
Chapter  VI  covers  the  Bernoulli,  Binomial,  Uniform,  Normal,  Chi-square, 
Exponential  and  Poisson  distributions,  which  are  commonly  used  in 
antisubmarine  warfare  applications.   Since  developing  understanding  of. 
the  applications  of  these  distributions  to  ASW  is  the  primary  "payoff" 
for  this  course,  Chapter  VI  is  significantly  longer  and  has  more 
applications  than  the  first  five  chapters. 

Each  chapter  has  clear  statements  of  pertinent  definitions  and 
principles,  together  with  illustrations  and  other  descriptive  material. 
This  is  followed  by  solved  problems,  which  serve  to  illustrate  and 
amplify  the  theory,  bring  into  sharp  focus  those  fine  points  without 
which  the  student  feels  himself  on  unsafe  ground,  and  provide  the  repeti- 
tion of  basic  principles  so  vital  to  effective  learning. 


CHAPTER  I:   FREQUENCY  DISTRIBUTION 

POPULATION  AND  SAMPLE 

In  collecting  data  on  characteristics  of  a  group  of  individuals 
or  objects,  such  as  heights  of  students  in  a  university,  or  the  number 
of  defective  and  non-defective  sonobuoys  produced  in  a  factory  on  a 
given  day,  it  is  often  impossible  or  impractical  to  observe  the  entire 
group,  especially  if  it  is  large.   Instead  of  examining  the  entire 
group,  called  the  population,  one  examines  a  small  part  of  the  group 
called  a  sample.   If  a  sample  is  representative  of  a  population,  important 
conclusions  about  the  population  can  be  reached  from  analysis  of  the 
sample.   Inferences  based  on  sample  data  are  not  absolutely  certain 
because  in  practice  it  is  usually  not  known  beforehand  whether  the 
sample  is  indeed  representative  of  the  population,  so  the  term  probability 
is  used  in  describing  conclusions.   The  term  statistics  is  used  in 
connection  with  conditions  under  which  such  an  inference  is  made. 

RAW  DATA 

Raw  data  are  collected  data  which  have  not  been  organized  numerically. 
An  example  is  the  set  of  heights  of  100  students  obtained  from  Naval 
Academy  records.   The  heights  of  the  100  students,  with  repetitions 
included,  are  raw  data  which  constitute  a  sample,  and  the  population 
from  which  the  sample  is  taken  is  the  set  of  heights  of  the  students 
from  the  entire  listing  of  the  Naval  Academy  records. 


ARRAYS 

An  array  is  an  arrangement  of  raw  numerical  data  in  ascending  or 
descending  order  of  magnitude. 

CLASSES  OR  CATEGORIES,  CLASS  FREQUENCY  AND  FREQUENCY  TABLE 


In  order  to  comprehend  what  masses  of  collected  raw  data  really 
mean,  and  to  draw  conclusions  from  these  data,  it  is  useful  to  distribute 
the  data  into  classes  or  categories.   The  number  of  individual  data 
points  belonging  to  a  class  is  called  the  class  frequency.   A  tabular 
arrangement  of  data  by  classes,  together  with  the  corresponding  class 
frequencies,  is  called  a  frequency  table.   Table  1  is  the  frequency  table 
of  the  heights  (recorded  to  the  nearest  inch)  of  the  100  Naval  Academy 
students. 


Height 

(inches ) 

Number  of 

Students 

(class) 

(class  frequency) 

60--  62 

5 

63  -  65 

18 

66   -  68 

42 

69  -  71 

27 

72  -  74 

8 

Total  100 

Table  1:   Heights  of  100  Naval  Academy  Students 


CLASS  INTERVALS  AND  CLASS  LIMITS 

The  symbol  defining  a  class,  such  as  60  -  62  in  Table  1,  is  called  a 
class  interval.   The  end  numbers,  60  and  62,  are  called  class  limits; 
the  smaller  number  60  is  the  lower  class  limit  and  the  larger  number  62 
is  the  upper  class  limit.   A  class  interval  which  has  no  upper,  or  no 
lower,  or  neither  an  upper  nor  a  lower  class  limit  is  called  an  open 
class  interval.   For  example,  an  open  class  interval  is  appropriate  for 
describing  torpedoes  with  a  range  of  "10,000  yards  or  greater." 

CLASS  BOUNDARIES 

In  Table  1  the  heights  are  recorded  to  the  nearest  inch,  and  the  class 
interval  60  -  62  includes  all  measurements  from  59.5000    to  62.5000 
inches.   These  numbers,  indicated  briefly  by  the  exact  numbers  59.5  and 
62.5,  are  called  class  boundaries;  the  smaller  number  59.5  is  the  lower 
class  boundary  and  the  larger  number  62.5  is  the  upper  class  boundary. 

SIZE  OR  WIDTH  OF  A  CLASS  INTERVAL 

The  size  or  width  of  a  class  interval  is  the  difference  between  the 
lower  and  upper  class  boundaries  and  is  referred  to  as  the  class  width, 
class  size,  or  class  length.   For  the  data  in  Table  1,  for  example,   the 
class  width  is  62.5  -  59.5  =  65.5  -  62.5  =  74.5  -  71.5  =  3. 

CLASS  MARK 

The  class  mark  is  the  midpoint  of  the  class  interval;  it  is  obtained 
by  adding  the  lower  and  upper  class  limits  and  dividing  by  two.   Thus  the 
class  mark  of  the  interval  60  -  62  is  (60  +  62)/2  =  61.   The  class  mark 
is  also  called  the  class  midpoint. 


GENERAL  RULES  FOR  FORMING  FREQUENCY  DISTRIBUTIONS 

1.  Determine  the  largest  and  smallest  numbers  in  the  raw  data 
and  thus  find  the  range  (the  length  of  the  interval  from 
the  smallest  to  the  largest  number) . 

2.  Divide  the  range  into  a  convenient  number  of  class  intervals 
having  the  same  size.   The  number  of  class  intervals  is 
usually  taken  between  5  and  20,  depending  on  the  data.   When- 
ever possible,  class  intervals  are  also  chosen  so  that  the 
class  marks  or  midpoints  coincide  with  actually  observed  data. 
However,  class  boundaries  usually  do  not  coincide  with 
actually  observed  data. 

3.  Determine  the  number  of  observations  falling  into  each  class 
interval;  i.e.,  find  the  class  frequencies.   As  an  example  of 

a  frequency  distribution,  consider  the  wages  in  dollars  per  week 
paid  to  enlisted  personnel  aboard  a  particular  destroyer. 
Suppose  the  wages  ranged  from  $50.00  to  $120.00  per  week 
with  8  personnel  receiving  wages  between  $50.00  and  $60.00, 
10  between  $60.00  and  $70.00,  16  between  $70.00  and  $80.00, 
14  between  $80.00  and  $90.00,  5  between  $100.00  and  $110.00 
and  2  between  $110.00  and  $120.00.   A  possible  formulation  of 
the  corresponding  frequency  distribution  is  presented  in 
Table  2  with  the  range  from  $50.00  to  $119.99,  in  7  ten  dollar 
class  interval  widths  and  the  total  number  of  personnel  listed 
for  each  class  interval. 


(dollars) 

Number  of 
Enlisted 

Personnel 

( cla sses ) 

(class    frequencies) 

§   5C.C0  -   4   59.99 

3 

$   60. CO'  -   3   69.99 

10 

$   70.00  -   5   79.99 

16 

|  80. CC  -    I   59.99 

14 

v   9C.CC  -   8   99.99 

10 

C1C0.C0  -   J109.99 

5 

8110.00  -  :i 19.99 

2 

Total  65 

Table  2:  Frequency  Distribution  of  the  Vv'eekly  "Wares 
in  Dollars  of  65  Enlisted  Personnel  .'.board 
a   Destroyer 

With   reference   to  Table   2  determine: 

1.  The   lower   limit  of   the    sixth   class.  Ans .      $100.00 

2.  The  upper   limit  of    the   fourth  class.        Ans.      $    89.99 

3.  The   class  mark  of    the   third   class. 

Class  mark  of   3rd   class  =  ^($70.00  +  $79.99)    -   $74,995 
(For  most   practical    purposes    this    is   rounded    to   $75.00) 

4.  The   class   boundaries   of   the   fifth   class. 

Lower   class   boundary   of   5th   class   =   %($90. 00  +  $89.99)    =  $89,995 
Upper   class   boundary  of  5th   class   =  ^($99.99  +  $100.00)    =   $99,995 

5.  The   size   of    the   fifth   class   interval 

Size  of   5th   class    interval   =  upper   boundary   of   5th   class    -    lower 
boundary  of   5th  class  -  $99,995   -   $89,995   =  $10.00 


HISTOGRAMS  AND   FREQUENCY   POLYGONS 


Histograms   and    frequency    polygons   are    graphical    representations   of 

frequency  distributions. 
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1.  A  histogram  consists  of  a  set  of  rectangles  having: 

(a)  Bases  on  a  horizontal  axis  (the  X  axis)  with  centers 

at  the  class  marks  and  lengths  equal  to  the  class  sizes. 

(b)  Areas  proportional  to  class  frequencies. 

2.  A  frequency  polygon  is  a  line  graph  of  class  frequency  plotted 
against  class  mark.   It  can  be  obtained  by  connecting  midpoints 
of  the  tops  of  the  rectangles  in  the  histogram. 

Figure  1  shows  a  histogram  and  frequency  polygon  developed  from  the  data 
listed  in  Table  1. 

RELATIVE  FREQUENCY  DISTRIBUTIONS 


The  relative  frequency  of  a  class  is  the  frequency  of  the  class 
divided  by  the  total  frequency  of  all  classes,  and  is  generally 
expressed  as  a  percentage.   For  example,  the  relative  frequency  of  the 
class  66  -  68  in  Table  1  is  42/100  =  42%.   The  sum  of  the  relative 
frequencies  of  all  classes  is  1,  or  100%.   If  the  frequencies  in  Figure  1 
are  replaced  by  corresponding  relative  frequencies,  the  figure  is  called 
a  relative  frequency  distribution. 
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Figure  1:   Histogram  and  Frequency  Polygon  of  Data 
Listed  in  Table  1  .' 


CUMULATIVE  FREQUENCY  DISTRIBUTIONS  AND  OGIVES 

The  total  frequency  of  all  values  less  than  the  upper  class  boundary 
of  a  given  class  interval  is  called  the  cumulative  frequency  up  to  and 
including  that  class  interval.   For  example,  the  cumulative  frequency 
up  to  and  including  the  class  interval  66  -  68  in  Table  1  is  5  +  18  +  42 
=  65,  which  means  that  65  students  have  heights  less  than  68.5  inches. 
A  table  presenting  such  cumulative  frequencies  is  called  a  cumulative 
frequency  distribution,  cumulative  frequency  table,  or  briefly  a 
cumulative  distribution,  and  is  shown  in  Table  3  for  the  Naval  Academy 
student  height  distribution.   A  graph  showing  tne  cumulative  frequency 
less  than  any  upper  class  boundary  plotted  againsc  the  upper  class 
boundary  is  called  a  cumulative  frequency  polygon  or  c_give_;  an  example 
is  shown  in  Figure  2  for  the  Naval  Academy  student  height  distribution. 


—  ■■■'■     - — — ■ — ■                  — ■ 

Height 
(inches ) 

Number  of 

Students 

less  than  59 • 5 

0 

less  than  62.5 

less  than  65.5 

23 

less  than  68.5 

65  ■ 

less  than  71 • 5 

92 

less  than  74.5 

1C0 

Table  3:   Cumulative  Frequency  Distribution 

of  the  Naval  Academy  Student  Height 
Distribution  of  Table  1 
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Figure   2:      O^ive   for   the   data'  of   Table   3 

RELATIVE  CUMULATIVE  FREQUENCY  DISTRIBUTIONS 

The  relative  cumulative  frequency  or  percentage  cumulative  frequency 
is  the  cumulative  frequency  divided  by  the  total  frequency.   For  example, 
the  relative  cumulative  frequency  of  heights  of  Naval  Academy  students 
with  heights  less  than  68.5  inches  is  65/100  =  657o,  which  indicates  that 
65%  of  the  students  have  heights  less  than  68.5  inches. 

TYPES  OF  FREQUENCY  CURVES 


Many  times  relative  frequency  curves  are  smoothed --that  is,  a  smooth 
curve  is  "fitted"  to  a  relative  frequency  polygon—and  are  used  to 
represent  situations  arising  in  ASW  applications,  such  as  probability  of 
detection.   (Tnese  probability  distributions  are  developed  in  Chapters  IV 
through  VI.) 
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Some  of  the  more  common  frequency  curves  arising  in  ASW  applications 
take  on  the  characteristic  shapes  as  indicated  in  Figure  3. 


Symmetrical   or  Bell-Shaped 


Skewed   to    the   Left 
(Negative   Skewness) 


Bimcdal 


Skewed  to  the  Right 
(Positive  Skewness) 


U  Shaped 


Multimodal 


Figure  3:   Types  of  Frequency  Curves 

The  symmetrical  frequency  curves  are  characterized  by  the  fact 
that  observations  equidistant  from  the  central  maximum  have  the 
same  frequency.   An  important  example  is  the  Normal  curve  shown 
in  Chapter  VI. 

In  the  skewed  frequency  curves  the  tail  of  the  curve  to  one  side 
of  the  central  maximum  is  longer  than  that  to  the  other.   If  the 
longer  tail  occurs  to  the  right  the  curve  is  said  to  be  skewed 
to  the  right  or  to  have  positive  skewness,  while  if  the  reverse 
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is  true  the  curve  is  said  to  be  skewed  to  the  left  or  to  have 
negative  skewness. 

3.  A  U-shaped  frequency  curve  has  a  maxima  at  both  ends. 

4.  A  bimodal  frequency  curve  has  two  maxima. 

5.  A  multimodal  frequency  curve  has  more  than  two  maxima. 
Supplementary  solved  problems  covering  arrays,  frequency  distributions, 

histograms,  frequency  polygons,  cumulative  frequency  distributions,  ogives, 
and  frequency  curves  are  presented  at  the  end  of  Chapter  2  Schaum's  Out- 
line Series,  Theory  and  Problems  of  Statistics.   These  problems  should 
be  consulted  if  a  greater  review  of  the  material  is  desired. 
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CHAPTER  II:      SPECIAL  MEASURES   OF   SAMPLE   DATA: 
CENTAL  TENDENCY  AND  DISPERSION 


MEASURES 

Although  data  relating  to  observations  of  events  (for  example,  the 
number  of  submarine  detections  in  a  given  region  during  a  specific 
period  of  time)  are  important  in  developing  methods  for  predicting 
similar  events  in  the  future,  the  data  in  the  raw  state  are  usually  not 
directly  suitalbe  for  making  these  predictions.   The  data  must   be 
reduced  to  a  format  that  is  still  more  manageable  than  the  frequency 
histogram  of  Chapter  I  in  order  to  facilitate  examination  further,  and 
to  a  format  from  which  the  pattern  of  events  on  which  the  data  are  based 
can  be  determined.   Such  a  representation  may  consist  of  "measures"  of 
the  data.   Through  the  use  of  these  measures,  an  entire  data  set  can, 
effectively,  be  summarized  by  two  or  three  single  numbers.   As  a  result, 
these  measures  are  very  important,  and  useful,  in  that  they  are  devices 
by  which  sets  of  sample  data  can  be  quickly  described. 

Two  particularly  useful  measures  of  sample  data  are  those  which 
indicate  or  describe  a  relative  central  value  of  the  sample  data 
(central  tendency),  and  the  degree  to  which  the  data  are  spread  out 
about  a  central  value  (dispersion) .   A  measure  of  central  tendency  can  be 
utilized  to  indicate  a  "common  quantity"  which  can  be  used  to  describe 
roughly  the  entire  sample  (and  possibly  even  the  entire  population). 
And  a  measure  of  dispersion  can  be  used  as  an  indicator  of  the  relative 
"error"  or  "spread"  that  exists  when  a  measure  of  central  tendency  is 
used  to  describe  the  sample  or  the  population. 
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A  digression  will  be  made  at  this  point  to  introduce  two  useful 
forms  of  notation,  index  or  subscript  notation,  and  summation  notation. 

SUBSCRIPT  NOTATION 

When  it  is  necessary  to  describe  a  variable,  X,  and  the  values  it 
can  assume,  and  also  to  distinguish  between  individual  members  of  the 
group  (these  members  are  specific  values  which  the  variable  X  can  assume — 
the  1st  x,  the  2nd  x,  etc.)>  subscript  notation  is  a  concise,  useful  means 
of  describing  specific  values  of  the  variable.   To  describe  the  values 
of  the  variable  X  with  subscript  notation,  the  first  value  (1st  x)  of 
the  group  is  denoted  by  x, ,  the  second  value  by  x?  and  so  forth  to  the 
last,  say  the  ntn,  value  by  x  .   The  entire  group  of  values  of  the 
variable,  X,  can  then  by  denoted  by 

X  =;  (x^,  x  ,  ...  ,  x_y  j 
where  the  three  dots  indicate  that  middle  values  are  included  in  the  set, 
but  are  not  explicitly  written  down  for  brevity. 

SUMMATION  NOTATION 

Summation  notation  is  a  concise  and  useful  means  of  denoting  the  sum 
of  all  or  part  of  the  individual  observed  values  of  a  variable  X.   The 
notation  used  to  denote  the  sum 


x,  +  x_  +  x„  +  ...  +  x 

n 


1   ~2   ~3 

is 


n 


i=l 

The  capital  Greek  letter  I  (sigma)  denotes  that  a  sum  is  to  be  taken  over 
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the  observed  values  of  the  variable  x.  which  follow   it,  and  the  values 

of  x.  that  are  to  be  summed  are  those  with  subscripts  between  i  =  1  and 

i  =  n.   For  example,  to  sum  the  third  through  the  fifth  observed  values 

of  X,  the  summation  is  written  as 

5 

Ix.  . 
i=3 

That  is, 

5 

I  xi  =  x3  +  x4  +  x5  ' 
i=3       J 

This  example  shows  that  the  lower  limit  on  i  need  not  be  1.   Further,  if 

the  values  X  can  assume  (say,  a  total  of  six  values)  are  x-,  =  0,  x9  =  3, 

Xo  =  7,  x^  =  2,  Xcj  =  1,  and  x^  =  9,  then 

5 

I  Xi  =7+2+1=  10. 
i=3 

The  following  example  shows  how  this  notation  can  be  used  in  an  ASW 

setting.   Suppose  there  are  a  total  of  six  regions  each  of  which  can  be 

monitored  by  a  single  sensor,  where  the  number  of  possible  targets  in 

each  region  is  known  for  given  periods  of  time  (from  intelligence) .   It 

is  important  for  the  search  planner  to  know  the  number  of  potential 

detection  opportunities  available  from  the  monitored  regions  so  that  he 

can  ensure  that  sufficient  localizing/ tracking  platforms  (ships,  aircraft, 

submarines)  are  on  standby  for  employment  upon  initial  detection.   As 

the  decision  to  monitor  regions  three  through  five  has  been  previously 

made,  he  wishes  to  specify  the  number  of  detection  opportunities  available 

in  these  regions  on  a  given  day  in  a  standard  format  regardless  of  the 

particular  values  entered  in  the  format  for  a  specific  period;  this  is 

because  he  wishes  to  include  this  calculation  in  a  larger  computer 
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program.   The  notation 

5 
i=3 
is  a  convenient  format  for  this  purpose.   In  any  particular  run  of  the 
program,  numerical  values  for  x~,  x,  and  x,-  must  be  entered. 

An  important  property  of  summation  is  that  if  a  constant  value,  k, 
is  summed  over  a  range  of  N  indices,  then  the  summation  of  this 
quantity  is  equivalent  to  multiplying  the  constant  k.  by  N,  since,  for 
example, 

5 

I  K  =  K+K  +  K  +  K  =  4K 
i=2     ' */ ' 

N  =  (5  -  2) -  +  1  -  4  terms 
MEAN,  MEDIAN,  AND  MODE 

Three  particularly  useful  measures  of  central  tendency  are  the  mean, 
the  median,  and  the  mode. 

The  mean,  which  is  similar  to  the  commonly  used  notion  of  arithmetic 

average,  is  computed  by  adding  the  observed  values  of  sample  data  and 

dividing  this  sum  by  the  total  number  (N)  of  data  points  in  the  sample. 

The  mean  can  be  expressed  in  summation  notation  as 

N 
X  =  I  I  x.  , 

where  N  is  the  total  number  of  observed  values  summed,  and  X  is  read  as 

"the  mean  value  of  X."   (Note  that  the  upper  limit  on  the  summation  index 

equals  the  reciprocal  of  the  constant  in  front  of  the  summation.)   For 

example,  the  mean  of  the  observed  values  x.  =  2,  x„  =  5,  and  x  =  1  of  x  is 

3 
X  =  1/3  (  Z  x.)  =  1/3  (2  +  5  +  1)  =  8/3  . 
i=l   L 
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The  utility  of  this  value  (%)  for  X  is  that  it  can  be  taken,  in  many 
cases,  to  state  a  "middle  value"  of  the  observed  data,  and  is  often  used 
as  the  best  single  number  estimate  of  the  values  of  X. 

For  example,  suppose  it  is  necessary  to  search  for  an  enemy  submarine 
which  is  currently  maintaining  a  "known"  missile  launching  station, 
and  the  following  conditions  exist: 

1.  the  "known"  missile  launch  station  is  actually  a  large  ocean 
region; 

2.  the  area  that  can  be  searched  effectively  is  one-half  the  size  of 
the  missile  station; 

3.  the  positions  of  previous  enemy  submarine  detections  within  the 
missile  launch  area  are  available  to  the  search  planner. 

The  above  information  is  depicted  in  Figure  4a  below,  where  the  dots 
represent  the  previous  detection  positions. 


.   .  *■•  • 
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Missile  Launch  Station 
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Unsearched  Area 


Mean  Position 


Figure  4:   Submarine  Search  Information: 

a.  Positions  of  Previous  Detections; 

b.  Mean  Search  Area 

It  can  be  seen  from  Figure  4a  that  the  positions  tend  to  cluster  about  a 
certain  small  enclosed  portion  of  the  launch  region;  therefore,  it  seems 
reasonable  to  search  about  this  small  area  in  order  to  increase  efficiency. 
The  center  of  this  small  region  is  the  mean  position  of  previous  enemy 
submarine  contacts.   Therefore,  to  obtain  the  position  about  which  the 
search  should  be  centered,  the  mean  of  the  previous  positions  is  calculated. 
The  search  is  subsequently  centered  about  this  mean  position,  as  seen  in 
Figure  4b. 

A  special  form  of  the  mean  is  the  weighted  (arithmetic)  mean.   This 
form  allows  one  to  account  for  differing  "levels  of  importance"  of  the 
individual  sample  observations.   For  example,  if  in  a  three  element  sample, 
Xi  were  considered  to  be  twice  as  important  as  x~,  and  Xo  four  times  as 
important  as  x2,  the  weighted  mean  of  the  sample  would  be  computed  as 
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miXi  +  n^x^  +  m-iXo 

X  =  : 

m-i  +  m^  +  mo 

2xi  +  lxo  +  4xo 
2  +  1  =  4 

where  the  m-  is  the  relative  weight,  or  level  of  importance,  assigned 

.th   , 
to  the  1   element. 

The  median  value  is  the  value  which  exactly  divides  the  sample  into 
two  equal  parts;  there  are  exactly  as  many  observations  greater  than  the 
median  value  as  there  are  less.   When  the  number  of  observations  is  even, 
the  median  is  exactly  half-way  between  the  two  "center"  observations,  but 
for  an  odd  number  of  observations  the  median  value  is^  the  center  observa- 
tion. 

For  example,  given  the  sample 

xl   =  2,  x2  =  5,  x3  =  1,  x4  =  7 
when  arrayed  is  1,  2,  5,  7  and  has  as  its  median 

2  +  5 
X  =   2  =  3.5  , 

and  we  note  that  this  is  not  one  of  the  values  actually  observed. 

The  mode  or  modal  value  of  the  sample  is  the  most  frequently  occurring 
value/s  in  the  sample.   If  more  than  one  value  occurs  at  this  same 
maximum  frequency  (of  occurrence),  then  the  sample  is  said  to  be  multi- 
modal ;  such  a  sample  would  have  a  frequency  curve  with  more  than  one  peak. 
For  example,  for  the  sample  xi  =  2,  x2  =  5,  Xo  =  2,  and  Xa  =  3,  the  mode 
is  X  =  2,  since  it  occurs  twice  in  the  sample,  while  all  other  observed 
values  appear  once.   Although  the  mode  is  easy  to  ascertain  from  a  sample, 
it  is  probably  the  least  useful  of  the  measures  of  central  tendency 
treated  here. 
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It  is  seen  that  each  of  the  three  measures  of  central  tendency 
mentioned  can  be  used  as  an  estimate  of  (or  statistic  which  describes) 
the  sampled  population;  however,  in  fact,  the  mean  is  probably  the  most 
useful  of  the  three  estimators  —  this  is  so  because  the  mean  is  less 
biased  than  the  other  two  measures.   In  other  words,  the  mean  is  not 
influenced  by  extreme  sample  values  to  the  extent  the  median  and  mode 
are.   Therefore,  for  most  practical  applications,  the  mean  of  a  sample 
reflects  the  central  tendency  of  the  parent  population  more  accurately 
than  the  median  or  the  mode. 

RANGE,  STANDARD  DEVIATION  AND  VARIANCE --MEASURES  OF  DISPERSION 

A  useful  statistical  indicator  on  sample  data  and  its  resident 
population  is  the  spread  of  the  sample  points  (usually  expressed  relative 
to,  or  about,  the  mean).   Such  an  indicator  is  called  a  measure  of 
dispersion  (spread) . 

Three  particularly  useful  measures  of  dispersion  are  the  range,  the 
standard  deviation,  and  the  variance. 

The  range  is  obtained  by  taking  the  difference  between  the  largest 
and  the  smallest  observed  value  in  the  sample.   While  the  range  of 
sampled  data  is  easy  to  compute,  it  is  not  as  accurate  an  indicator  as 
the  other  two  mentioned.   This  is  due  to  the  large  degree  to  which  one  or 
two  sample  values  (a  low  frequency  of  occurrence)  at  relatively  extreme 
distances  from  the  center  of  the  greatest  portion  of  the  sample  can  bias 
the  estimate  of  the  actual  spread  (or  dispersion)  of  the  sampled 
population. 

The  standard  deviation  Q   (lower  case  Greek  sigma)  is  obtained  by 
first  computing  the  sample  mean,  and  then  using  the  formula 
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«.N  i=l  J 


Since  the  standard  deviation  tends  to  weight  each  observed  value  by  its 

frequency  of  occurrence,  it  is  seen  to  be  a  more  accurate  measure  of 

dispersion  than  the  range;  that  is,  the  standard  deviation  is  not  unduly 

biased  by  a  few  extreme  sample  values. 

The  variance  is  found  by  simply  squaring  the  standard  deviation. 

Consequently,  the  variance  has  all  the  advantages  of  the  standard 

deviation.   The  mathematical  form  for  the  variance  is 

n       -  2 
^2.11   (xt  -  X)Z  . 

U  N  i=l 

An  equivalent  form  for  the  variance  is 

a2  -  ?  -  x2 , 

where 

~2   1  n   2        _2    i  n     2 
X  =  «  Z  x.     and  X  =  (jj  I  x.)   . 
1=1  i=l 

The  latter  equation  is  the  "short  method"  for  computing  (j       and  is  the 
recommended  method  of  computing  the  variance;  the  positive  square  root 
of  this  equation  is  the  "short  method"  for  computing  Q     and  is  recommended 
for  computing  the  standard  deviation. 

As  an  example  which  illustrates  the  equivalence  of  the  two  listed 
forms  for  computing  rf   ,  consider  the  case  where  three  BT  (bathythermo- 
graph) readings  are  observed  and  the  first  shows  a  sea  surface  temperature 
of  50°F,  the  second  48°F,  and  the  third  52°F.   The  variance  for  these 
readings  is  first  found  by 
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2    1   n        -  2  -2 

tfL   =  \     I  (xt  -  X)Z  =  1/3  I   (xt  -  X)Z 

i=l  i=L 

=  1/3    (50  -  X)2  +  (48  -  X)2  +  (52  -  X)2] 
(j1   =  1/3  (0  +  4  +  4)  =  8/3  . 


2 
Now,  (j       is  found  by 


cr2  =  ^-x2 , 


where 


3 

X2  =  1/3  Z  x2  =  1/3  (502  +  482  +  522)  =  7508/3 
i=l 


and 


x2  = 


(1/3  I  xj2  =  [l/3  (50  +  48  +  52)  " 


so  that 


=  502  =  7500/3, 


(j2   =  7508/3  -  7500/3  =  8/3  . 


(The  second  method  looks  longer  than  the  first,  but  the  mean  had  to  be 
calculated  prior  to  the  first  method.)   Hence,  this  example  illustrates 
the  equivalence  of  the  two  formulas  for  Q     . 

As  an  example  of  the  usefulness  of  the  standard  deviation,  consider 
the  Normal  probability  frequency  distribution  (which  is  covered  in  detail 
in  Chapter  VI)  shown  in  Figure  5.   If  sigma  (the  standard  deviation)  is 
known  for  the  distribution,  the  percentage  of  cases  in  which  any  (observed) 
value  could  be  expected  to  have  come  from  the  intervals  (of  the  distribu- 
tion) X  -(j      toX+O'  ,  X  -  2Cf   COX+2C''   orX-3CTtoX+  3(J      is 
68.27%,  95.45%,  or  99.73%,  respectively,  as  shown. 
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X-2CT     X         x+2<r 

95.45/^ 


99/73^ 


Figure    5:      Normal  Probability  Frequency  Distributions 

As  an  application,  consider  a  situation  where  it  is  necessary  to 
obtain  the  probability  of  detecting  a  submarine  at  a  distance  of  45  n.m. 
from  the  detection  device  (which  is  denoted  by  ?Ae*-(^5   n.m.)  ),  given 
the  FOM  (Figure-of -Merit) ,  the  pertinent  PLP  (Propagation  Loss  Profile), 
and  the  Q -value  for  the  probability  of  detection. 

The  first  step  is  to  visualize  a  Normal  proabbility  curve  whose 
"x-axis"  is  located  vertically  along  the  45  n.m.  line  of  the  PLP,  and 
whose  mean  value  is  on  the  FOM  line  along  the  "x-axis,"  as  seen  in 
Figure  6. 
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Figure  6:   FLP  (Propagation  Loss  Profile) 
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Next,  recall  that  the  area  under  the  Normal  curve  to  either  side  of  the 
FOM  (the  mean  value  for  the  distribution)  corresponds  to  507„  probability 
of  detection.   Note  that  the  P-iet.(45  n.m.)  is  represented  by  the  darkened 
area  under  the  Normal  curve  and  under  the  PLP  curve  at  45  n.m.,  and  that 
the  intersection  of  the  PLP  and  the  axis  of  the  probability  curve  is 
at  -95  dB  or  -l(j         relative  to  the  mean  (the  FOM).   Now  the  P^et^5  n.m.) 
is  calculated  as  the  difference  between  50%  (the  lower-half  of  the  area 
under  the  Normal  curve)  and  one-half  of  the  area  (under  the  Normal  curve) 
from  -1Q    to  +  \Q         (which  was  previously  shown  to  be  68.27%),  or 

Pdet(45  n.m.)  =  ho  -  \   (68.27)1  %  =  15.87%  . 
(Other  methods  for  obtaining  probabilities  are  presented  in  Chapters  IV 
through  VI.) 

Supplementary  solved  problems  covering  special  measures  of  sample 
data  are  presented  at  the  end  of  Chapters  3  and  4  of  Schaum's  Outline 
Series,  Theory  and  Problems  of  Statistics.   These  problems  should  be 
consulted  for  a  further  review  of  the  material. 
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CHAPTER  III:   COMBINATORIAL  ANALYSIS 

COMBINATORIAL  ANALYSIS 

Combinatorial  analysis  involves  application  of  counting  principles 
to  obtain  probabilities  of  complex  events.  These  basic  principles  are 
presented  in  this  chapter. 

THE  FUNDAMENTAL  COUNTING  PRINCIPLE 


If  an  event  can  happen  in  any  of  n  ways,  and  if  when  this  has  occurred 
another  event  can  happen  in  any  of  n„  ways,  then  the  number  of  ways  in 
which  both  events  can  happen  is  n.  n~  .   Example:   if  3  submarines  are  to 
transit  2  barriers,  then  the  number  of  ways  the  submarines  can  transit 
the  barriers  are  (2)  (3)  =  6  ways;  if  5  submarines  are  to  transit  3  barriers 
then  there  would  be  (3) (5)  =  15  ways. 


FACTORIAL  n 

Factorial  n,  denoted  by  n!,  is  equal  to  n(n  -  l)(n  -  2)...l.   Thus 
5!  =  (5)(4)(3)(2)(1)  =  120,  413!  =  ((4) (3) (2) (1)) ((3) (2) (1))  =  144.   It 
is  convenient  to  define  0!  =  1.   Further,  each  factor  in  the  factorial 
product  must  be  a  (nonnegative)  integer. 

PERMUTATION  PRINCIPLE 

A  permutation  is  an  arrangement  of  r  out  of  n  objects  with  attention 
given  to  the  order  of  arrangement.   The  number  of  permutations  of  n  objects 
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taken   r  at  a    time   is   denoted   by  nPr,    P(n,r)    or   Pn,r  and   is   given  by 

nPr  =  n(n   -    1) (n   -   2) . . . (n   -   r  +  1)    =     SJ 

(n  -  r)! 

For  example,  how  many  ways  may  5  destroyers  be  placed  in  a  5-position 

screen?   The  first  position  can  be  occupied  by  any  one  of  the  5  destroyers; 

that  is,  there  are  5  ways  of  filling  the  first  position.   When  this  has 

been  done  there  are  4  ways  of  filling  the  second  position.   Then  there 

are  3  ways  of  filling  the  third  position,  2  ways  of  filling  the  fourth 

position  and  1  way  of  filling  the  last  position.   Therefore,  n  represents 

the  5  destroyers  and  r  represents  the  number  of  destroyers  that  may  be 

used  in  the  screen  which  also  is  5;  hence, 

nPr  =    n!    =    5!    =  5j_  =  51  =  5!  =  (5)  (4)  (3)  (2)  (1)  =  120 
(n  -  r)l    (5-5)!   0!    1 

Now  suppose  5  destroyers  are  available  for  placement  in  a  3  position 

screen.   How  many  ways  may  the  destroyers  be  placed  in  the  screen?   The 

first  position  can  be  occupied  by  any  one  of  the  5  destroyers,  then  the 

second  position  can  be  occupied  by  any  of  the  remaining  4  destroyers  and 

finally  the  last  position  may  be  occupied  by  any  of  the  remaining  3 

destroyers.   Therefore,  n  represents  the  number  of  destroyers  which  is  5, 

and  r  represents  the  number  of  destroyers  that  may  be  used  which  is  3;  so 

that, 

nPr  =  5P3  =    nl    =    5!    =  5_1  =  (5)  (4)  (3)  (2)  (1)  =  120  =  60. 
(n  -  r)!    (5  -  3)1   2!       (2)(1)         2 

COMBINATION  PRINCIPLE 

A  combination  is  a  selection  of  r  out  of  n  objects  without  counting 
the  order  of  arrangement.   The  number  of  combinations  of  n  objects  taken 
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r  at  a  time  is  denoted  by  nCr,  C(n,r),  Cn,r  or  (£)  and  is  given  by 

nCr  =  -n(n  -  1)  ...  (n  -  r  +  1)  =     n!     =  nPr 

r!  rl(n  -  r) !    r!   . 

For  example:   Suppose  a  submarine  with  six  torpedoes  approaches  a 

convoy  of  10  ships.   If  the  ships  are  selected  at  random  and  one  torpedo 

is  shot  at  each  ship,  how  many  ways  may  the  torpedoes  be  shot  at  the 

10  ships? 

nCr  =     n!     =     10!     =   (10)  (9)  (8)  (7)  (6)  (5)  (4)  (3)  (2)  (1)   = 
r!(n  -  r)!   61(10  -  6)!    ((6)  (5)  (4)  (3)  (2)  (1))  ((4)  (3)  (2)  (1)) 

3,628,800  =  3,628,800  =  210  ways 
(720) (24)     17,280 

Now  suppose  3  of  the  ships  are  warships  protecting  the  convoy  and  it  is 

desired  to  keep  clear  of  these  ships.   If  the  remaining  7  ships  are 

selected  at  random  and  one  torpedo  is  shot  at  each  ship,  how  many  ways 

may  the  torpedoes  be  shot  at  the  7  ships? 

nCr  =     n! =  7_I =  7  ways 

r!(n  -  r)!    61(7-6)1 

Supplementary  solved  problems  covering  permutations  and  combinations 

are  presented  at  the  end  of  Chapter  VI  of  Schaum's  Outline  Series, 

Theory  and  Problems  of  Statistics.   These  problems  should  be  consulted 

for  further  review  of  the  material. 
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CHAPTER  IV:   ELEMENTARY  PROBABILITY  THEORY 

ELEMENTARY  PROBABILITY  THEORY 

The  concept  of  probability  is  commonly  used  as  a  means  with  which  to 

state  (or  predict)  the  percentage  of  possible  cases  in  which  a  specific 

event  will  occur;  for  example,  the  percentage  of  events  in  which  a  specific 

number  will  appear  on  a  single  toss  of  a  die.   The  probability  of  such  an 

event  occurring  is,  in  many  cases,  given  as  the  ratio  of  the  number  of 

ways  in  which  the  event  can  occur  to  the  number  of  ways  in  which  ALL 

events  can  occur.   So  the  probability  that  a  two,  for  example,  will  occur 

on  a  single  toss  of  a  fair  die  is 

P(2)  =  *  ■ 
6 

where  the  numerator  represents  the  one  way  in  which  a  two  can  occur,  and 

the  denominator  represents  the  six  ways  in  which  each  of  the  numbers 

1,  2,  3,  4,  5,  6  can  occur;  hence,  the  probability  that  a  two  will  be 

the  outcome  of  a  single  toss. of  a  die  is  one-sixth,  or  in  other  words, 

the  "chances"  of  this  outcome  occurring—an  event--are  "one  in  six." 

Another  method  of  specifying  probability  of  an  event  is  to  state 

the  probability  in  terms  of  proportional  areas.   The  basis  of  this  method 

is  to  think  of  the  probability  of  occurrence  of  ALL  events  as  an  area, 

say  "A,"  and  the  probability  of  a  particular  event  is  simply  the  ratio 

of  the  areas 


P(event  of  interest) 


=  B 
K 


where  B  is  the  area  corresponding  to  the  event  of  interest. 
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Probabilities  specified  as  proportional  areas  can  be  represented  by 
diagrams  of  the  corresponding  regions.   A  diagram  of  two  regions  with 
areas  "A"  and  "BM  is  shown  in  Figure  7. 


Figure  7;   Proportional  Areas  for  Probability  Analogy 

This  figure  facilitates  understanding  the  probability  of  event  "B"  by 
illustrating  the  area  which  is  assigned  to  a  region  relative  to  the 
total  area,  "A,"  assigned  to  the  occurrence  of  ALL  events.   Since  the 
diagram  visually  represents  the  probability  of  the  event,  it  is  a  useful 
means  of  intuitively  developing  the  concept  of  probability.   (A  carefully 
cultured  intuition  can  be  very  useful  when  dealing  with  probability.) 

It  is  necessary  here  to  define  some  terms  which  will  make  the 
formulation  of  basic  probability  theory  easier.   The  first  of  these  terms 
is  PROBABILITY  EXPERIMENT,  which  is  defined  as  a  type  of  trial  procedure 
for  selecting  a  specific  outcome.   An  example  is  that  of  tossing  a  die 
to  see  what  number  comes  up  and  noting  the  occurrences  (or  of  flipping 
a  coin  to  see  whether  "heads"  or  "tails"  occurs). 

Next,  a  SAMPLE  SPACE  is  the  totality  of  outcomes  which  can  occur 
for  the  probability  experiment.   In  terms  of  plane  regions,  the  sample 
space  is  the  large  region  ("A"  in  the  previous  example)  which  contains 
all  possible  outcomes  of  a  specific  probability  experiment. 
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An  OUTCOME  is  a  specific  value  (or  set  of  values)  resulting  from  a 
single  iteration  of  a  probability  experiment,  and  it  is  the  basic 
element  (or  smallest  unit)  of  the  sample  space  (for  the  specific 
probability  experiment) .   An  example  would  be  the  occurrence  of  one 
specific  permutation  (of  the  216  possible  permutations)  of  the  numbers 
appearing  on  the  three  die  in  the  (probability)  experiment  of  tossing 
three  dice  on  a  single  toss  and  noting  the  numbers  which  appear,  where 
the  goal  of  the  experiment  is  to  see  whether  or  not  a  one,  a  two,  and  a 
three  appear  on  the  toss.   This  can  be  represented  as  one  of  the  totality 
of  points  that  make  up  the  sample  space  (a  three  dimensional  space,  as 
each  outcome  is  composed  of  three  numbers) . 

An  EVENT,  on  the  other  hand,  is  a  set  of  outcomes;  in  the  previous 
example,  an  outcome  is  the  sequence  of  the  three  specific  numbers  which 
occur  as  a  result  of  the  toss  of  the  three  dice,  and  the  event  consists 
of  six  points  of  interest  (outcomes)  in  3-space,  as  shown  in  Table  4. 


Die   #1 

Die   #2 

Die   #3 

(    1 

2           , 

3   ) 

(    1 

• 

3 

>                2    ) 

(    2 

1           , 

3   ) 

(    2 

3          , 

»             1    ) 

(    3 

1          , 

}                2    ) 

(   3 

2           , 

►                1     ) 

Table   4: 

The   Six 

Cute 

jo  me  s    Thai 

1  Comprise 

the  Event  of  Obtaining-  a  One,  a  Two,  and 
a  Three  en  a  Single  Toss  of  Three  Dice 
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In  terms  of  the  proportional  areas  analogy,  an  event  is  an  area  ("BM  in 
the  earlier  example)  which  contains  one  or  more  outcomes  and  is  a  sub- 
area  of  the  sample  space  (MA) .   An  extension  of  this  terminology  is  the 
term  JOINT  EVENTS,  which  describes  a  combined  event  in  which  two  or  more 
simple  events  occur  on  the  same  iteration  of  the  experiment. 

In  summary,  the  sample  space  is  comprised  of  all  possible  outcomes 
of  a  probability  experiment,  and  these  outcomes  are  grouped  into  sets 
called  events—outcomes  are  the  elements  of  an  event. 

The  remaining  basic  concept  in  the  theory  is  that  of  understanding 
the  PROBABILITY  FUNCTION.   A  probability  function  is  a  device  by  which 
a  number  (or  area)  in  the  sample  space  is  assigned  (as  previously 
mentioned)  to  an  event.   An  example  of  a  probability  function  is  the 
assignment  of  the  value  1/3  to  the  probability  of  each  of  the  events 
that  a  single  submarine,  during  a  single  transit,  will  cross  one  of 
three  barriers  each  of  which  the  submarine  is  equally-likely  to  cross. 
This  is  written  as 

P(SS  transits  #1)  =  1/3 

P(SS  transits  #2)  =  1/3 

P(SS  transits  #3)  =  1/3  , 
which  can  also  be  represented  in  an  area  diagram  as  in  Figure  8,  where  it 
is  seen  that  all  three  areas  are  equal  and  that  no  area  overlaps  any 
other,  which  is  interpreted  to  mean  that  the  submarine  can  cross  only 
one  of  the  barriers  during  a  single  transit. 
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Area  1 
=  '/3 

Area  2 
=  1/3 

Area  3 
=  1/3 

Figure   8:      Frcportlcnal  Area   Depiction  of 

Equal-likelihood  Probability  Barriers 


There  are   basic   restrictions   on   the   probability   function.      One  of 
these   is    that   the  values  assigned   to   events   by   the   probability   function 
must   sum   to  one,    or 

n 

I  Pi  =   1    , 
i=l 

where  i  corresponds  to  the  i   event. 

Another  is  that  the  values  assigned  must  be  between  zero  and  one,  or 

0   -  p.  *  1  . 

""   l 

This  restriction  is  a  result  of  the  fact  that  probability  is  a  ratio  of 

a  small  area  to  a  larger  one.   Also,  since  a  specific  event  in  a  given 

experiment  will  either  occur  or  not  occur,  the  sum  of  the  probabilities 

that  the  event  will  and  the  event  will  not  occur  must  equal  one.   This  is 

represented  mathematically  as 

P(Specific  event  occurs)  +  P(Specific  event  =  1 

does  not  occur) 

or  by  rearrangement 

P(Specific  event  does  not  occur)   =  1  -  P(Specific 

event  occurs) 


or 


P(Specific  event  occurs)  ■  1  -  P(Specific  event 

does  not  occur)  . 
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As  seen  by  the  last  two  arrangements  of  the  first  equation,  the  probability 
of  an  event  occurring  and  the  probability  of  the  same  event  not  occurring 
on  the  same  trial  of  an  experiment  are  complements  of  each  other  in  the 
probability  sense;  that  is,  the  probability  of  an  event  occurring  can 
also  be  described  by  one  minus  the  probability  of  the  same  event  not 
occurring.   As  will  be  seen  later,  this  idea  of  PROBABILITY  COMPLEMENT 
of  the  occurrence  of  an  event  has  many  useful  computational  applications. 

We  now  pass  from  probabilities  for  single  events  to  probabilities 
for  several  events.   There  are  two  cases  of  interest:   the  probability 
of  joint  events,  and  the  probability  of  at  least  one  of  two  (or  more) 
events.   Joint  events  are  denoted  by  the  word  "and";  for  example,  the 
probability  of  events  A  and  B  occuring.   The  other  case  is  denoted  by 
the  word  "or";  for  example,  the  probability  of  event  A  or  B  (or  both 
occurring,  where  A  or  B  or  both  is  the  sense  in  which  it  is  used  in  this 
course).   This  "or"  probability  is  shown  by  the  proportional  areas 
analogy  as  in  Figure  9. 


Figure  9:   Probability  of  Events  A  "or"  B 


l,nv>"  T^ 


It  is  seen  from  this  figure  that  the  probability  of  event  A  or  B  is 
represented  as  the  probability  of  A  plus  the  probability  of  B,  minus 
the  probability  of  A  and  B  occurring.   The  reason  for  subtracting  the 
joint  probability  of  A  and  B  is  that  the  area  corresponding  to  the  joint 
probability  ("cross-hatched"  in  the  figure)  is  common  to  both  event  A  and 
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event  B,  so  that  when  area  A  is  added  to  area  B  the  "cross-hatched"  area 

is  added  twice;  therefore,  the  joint  probability  must  subsequently  be 

subtracted  once.   The  probability  of  A  or  B  occurring  is  therefore  expressed 

mathematically  as 

P(A  or  B)  =  P(A  +  B)  =  P(A)  +  P(B)  -  P(A  and  B) 

=  P(A)  +  P(B)  -  P(A  •  B). 

Figure  10  shows  a  case  in  which  events  A  and  B  have  no  area  of  overlap; 

that  is,  A  and  B  do  not  have  any  area  in  common. 


Fi  f ure  1 0 : 


Probability  of  Events  A  "or"  E  when 
Events  -t.  and  3   are  i-.utually  Exclusive 


It  is  easily  seen  from  this  figure  that  any  outcome  in  event  A  cannot 
possibly  also  be  a  part  of  event  B;  in  fact,  the  existence  of  the  outcome 
in  either  A  or  B  precludes  its  simultaneous  existence  in  the  other  event. 
Two  such  events  are  said  to  be  MUTUALLY  EXCLUSIVE. 

The  joint  probability  of  A  and  B  can  be  shown  through  the  proportional 
areas  analogy  as  in  Figure  11. 


A  and  2 


Figure    11:       The   Joint  Probability  of  Events  A  and  B 
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It  is  seen  from  the  figure  that  the  probability  of  the  joint  event  is 
the  ratio  of  the  "cross-hatched"  area  (the  joint  event  itself)  to  the 
entire  area  (which  is  equal  to  one  since  it  is  a  probability  space). 
Mathematically  the  joint  probability  of  events  A  and  B  is  represented  as 

P(A  and  B)  -  P(A  •  B)  =  P(A)  +  P(B)  -  P(A  +  B)  . 
(This  is  simply  a  rearrangement  of  the  "or"  probability  equation  developed 
previously.) 

The  symbol  P(B  A)  denotes  the  CONDITIONAL  PROBABILITY  of  event  B 
given  that  event  A  has  already  occurred;  that  is,  it  is  the  probability 
of  B  on  the  condition  that  event  A  has  occurred  previously.   It  is  seen 
from  Figure  12,  that  P(B  A)  differs  from  P(B)  by  the  fact  that  the 
sample  space  (S)  from  which  B  is  drawn  is  larger  than  the  resultant 
sample  space  (S ' )  from  which  B  is  drawn  when  A  has  been  given  to  occur 
on  this  same  outcome  (the  event  "B  given  A").   This  is  due  to  the  fact 
that  S  has  been  reduced  to  S'  after  it  is  known  that  the  event  yet  to 
occur  (B)  is  contained  in  the  event  that  has  already  occurred  (A) ; 
therefore,  in  terms  of  the  ratio  idea,  the  denominator  has  been  decreased, 
thereby  increasing  the  probability  of  B  for  the  conditional  case  (relative 
to  the  unconditioned  case). 


Fi  gure  1 2 : 


S'  =  A 


Comparison  of  Unconditioned  Probability  to 
Conditional  (or  Conditioned)  Probability 


CONDITIONAL  PROBABILITY  actually  epitomizes  the  usefulness  of  probability 
to  ASW.   That  is,  generally,  an  area  is  searched  for  the  presence  of  a 
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submarine,  and  both  the  cases  in  which  the  submarine  1)  has  or  2)  has  not 
been  detected  as  a  result  of  previous  search  efforts  are  important 
information.   Each  allows  the  searcher  to  refine  his  prediction  of  the 
target's  position  (in  most  cases)  to  a  smaller  and  smaller  portion  of 
the  original  search  region  and,  thereby,  increase  the  probability  of 
detecting  the  submarine  on  subsequent  portions  of  the  search.   It  is 
obvious  how  this  occurs  in  the  case  where  the  target  is  detected  on  a 
given  previous  effort,  but  the  case  of  no  detection  on  previous  search 
efforts  may  not  be  so  obvious.   The  fact  that  the  submarine  was  not 
detected  on  previous  portions  of  the  search  allows  the  searcher  to 
eliminate  (again,  in  most  cases)  that  portion  which  has  been  searched-- 
similar  in  idea  to  "sterilizationM--thereby  conditioning  the  original 
sample  space  (this  should  be  visualized  as  multiple  iterations  of  the 
process  depicted  in  Figure  12) .   Having  previous  knowledge  (intelligence 
data)  which  allows  the  searcher  to  eliminate  portions  of  the  (original) 
search  area  is  another  way  in  which  this  conditioning  can  occur. 

The  joint  probability  of  events  A  and  B  (discussed  previously)  can  be 
represented  as 

P(A  and  B)  =  P(b|A)  •  P(A)  . 
This  equation  has  more  intuitive  appeal  than  the  previous  equation  for 
joint  probability,  as  it  is  simply  an  extension  of  the  "Counting  Principle"; 
it  indicates  that  the  probability  of  both  event  A  and  B  occurring  is  the 
probability  that  event  A  occurs  and  "event"  B  given  A  occurs. 

From  the  last  equation  it  is  seen  that  if  P(A)     0,  then  the 
conditional  probability  of  B  given  A  is  mathematically  represented  as 

P(B|A)  =  P(A  •  B)  /  P(A)  . 
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Three  important  methods  for  computing  conditional  probabilities  are: 

1.  By  utilizing  the  definition  of  conditional  probability 

P(B|A)  =  P(A  •  B)  /  P(A)  . 

For  example,  it  is  known  (by  some  means  or  another)  that  a  submarine 

skipper  will  not  choose  one  of  three  known  and  equally-likely  barriers 

to  transit,  then  the  resulting  probability  is  increased  beyond  that  for 

the  original  choice  due  to  the  sample  space  drawn  on  having  been  reduced 

from  three  possible  paths  to  two.   If  the  barrier  eliminated  is  #1,  then 

the  resulting  sample  space  S'  consists  of  barriers  #2  and  #3,  vice  the 

original  sample  space  S  which  was  made  up  of  barriers  #1,  #2,  and  #3. 

Therefore,  the  probability  that  the  submarine  will  transit  barrier  #2, 

given  it  will  transit  either  #2  or  #3  is  seen  to  be 

P(B|A)  =  P(A  •  B)  =  P  [  (#2  or  #3)  and  (#2) J 

P(#2  or  #3) 

=  P(#2) since  the  only  way  in  which  #2  and 

P(#2  or  #3)   (#2  or  #3)  can  occur  is  if  #2  occurs 

=    1/3    =  1/3  =  1/2  . 
1/3  +  1/3   2/3 

2.  By  simply  writing  the  conditional  probability  down,  as  sometimes 
the  information  that  A  has  occurred  enables  the  determination  of  the 
probability  of  B. 

For  example,  in  the  experiment  of  drawing  two  cards  from  a  deck  of 
playing  cards  one  at  a  time  without  replacing  the  drawn  card  in  the  deck, 
it  is  known  that  the  first  card  drawn  is  a  Spade.   Given  that  the  first 
card  is  a  Spade,  the  probability  that  the  second  card  is  also  a  Spade 
can  immediately  be  written  down  as 

P(draw  #2  is  a  Spade  draw  #1  is  a  Spade)  =  12/51  , 
since  the  original  13  Spades  in  the  deck  have  been  reduced  to  12,  and 
the  original  52  total  cards  (the  sample  space)  to  51  by  the  first  draw. 
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The  probability  thus  formed  is  consistent  with  the  ratio  concept  of 
probability. 

3.   By  utilizing  Bayes '  law,  which  will  now  be  presented  without 
proof  and  with  only  the  following  restriction. 

Bayes'  law  holds  for  experiments  on  sample  spaces  which  consist  of 
definite  groups  of  events  that  do  not  "overlap."   Such  a  sample  space  is 
seen  to  correspond  to  the  delimited  area/s  in  Figure  13. 


Figure  1 3:   A  Partitioned  Sample  Space 


Such  a  sample  space  is  said  to  be  PARTITIONED. 

Bayes'  law  states  that  on  a  partitioned  space  the  probability  of  an 
event  A  occurring,  given  that  event  B  has  previously  occurred  is, 


p(a|b)  = 


P(BIA)  •  P(A) 


P(B|A)  •  P(A)  +  P(B|AC)  •  P(AC) 
where  P(A  )  is  the  probability  complement  (covered  earlier)  of  P(A),  or 
P(AC)  =  1  -  P(A). 

For  example,  suppose  it  is  known  from  intelligence  reports  that 
missiles  of  type  C  and  D  are  carried  by  exactly  two  classes  of  Soviet 
submarines,  say  class  A  and  class  B.   Submarines  of  class  A  are  known 
to  carry  16  missiles  of  type  C  and  eight  of  type  D,  while  those  of 
class  B  are  known  to  carry  ten  of  each.   A  missile  is  fired  from  a 
submarine  and  observed  to  be  of  type  C.   Find  the  probability  that  the 
missile  was  fired  from  a  class  A  sub,  under  the  following  conditions: 
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1.   Each  missile  to  be  fired  is  chosen  at  random  from  the  total 

supply  of  missiles  aboard  the  submarine,  and 
.  2.   the  Soviets  normally  operate  twice  as  many  subs  of  class  B  as 

of  class  A  in  the  area. 

The  problem  is  to  find  P(A  C) ;  it  can  be  found  using  Bayes'  law.   To 
determine  the  values  to  be  used,  it  is  best  to  first  draw  two  blocks 
representing  the  class  A  and  B  submarines  and  the  types  of  missiles  they 
carry  as  shown  in  Figure  14.   (Remember  that  there  are  two  class  B  subs 
for  each  class  A.) 
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Fi  pure  1 4 


By  Bayes'  law  then 

p(a|c)  = 


P(ClA)  •  P(A) 


P(C|A)  •  P(A)  +  P(C|AC)  •  P(AC) 


c  _ 


here  A   =  B;  therefore, 


P(clA)  •  P(A) 


P(C|A)  '  P(A)  +  P(CjB)  '  P(B) 


(16/24)  •  (1/3) 


=  2/9   =2/5 


(16/24)  •  (1/3)  +  (10/20)  •  (2/3)    5/9 

As  a  second  example  of  the  application  of  Bayes'  law,  consider  a 

circuit  breaker  used  to  protect  a  certain  piece  of  ASW  hardware  from 

short  circuits.   At  the  moment  the  "on"  switch  is  thrown  to  activate 

the  equipment,  the  circuit  breaker  essentially  "tests"  the  circuit  for  a 

short.   If  a  short  is  detected,  the  circuit  breaker  should  immediately 

trip;  otherwise  it  should  allow  operation  to  continue. 

40 


Suppose  it  is  known  that  although  short  circuits  do  not  always  trip 
the  circuit- breaker,  they  do  trip  it  99.5%  of  the  time.   On  the  other 
hand,  suppose  the  probability  that  the  circuit  breaker  trips  when  there 
is  no  short  is  3%.   If  the  probability  that  there  is  a  short  in  the 
equipment  when  the  switch  is  thrown  is  0.01,  find  the  conditional 
probability  that  there  is  no  short  circuit  given  that  the  circuit  breaker 
does  not  trip. 

The  technique  of  solution  involves  several  steps:   1)  describe 
mathematically  the  probability  "events"  involved;  2)  formulate  what  is 
given  and  immediate  consequences  of  the  given;  3)  formulate  the  unknown; 

4)  write  down  Bayes'  law  as  it  applies  to  the  events  of  this  problem; 

5)  substitute  the  given  numbers  and  solve. 

1)  Let:   T  be  the  event  that  the  circuit  breaker  trips; 

Tc  the  event  that  the  circuit  breaker  does  not  trip 

S  the  event  that  there  is  a  short  circuit  when  the  "on" 
switch  is  thrown; 

Sc  the  event  that  there  is  no  short  when  the  "on" 
switch  is  thrown. 

2)  We  are  given  that 

P(T[S)  =  0.995,  P(T|SC)  =  0.03,  P(S)  =  0.01. 
From  the  given  it  follows  immediately  that 
P(Tc|s)  =  0.005,  P(TC|SC)  =  0.97,  P(SC)  =  0.99. 

3)  The  problem  is  to  find 

p(sc|tc)  . 

4)  By  Bayes1  law, 

P(SC|TC)  =  P(TC1SC)  •  P(SC) 

I       P(TC|SC)  '  P(SC)  +  P(TC|S)  •  P(S)  . 

5)  Substitute: 
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(0.97)  (0.99) 


p(s  lT  )  ■  (0.97)  (0.99)  +  (0.005)  (0.01) 


Q-9603 =  .999948 

0.9603  +  0.00005 


STATISTICAL  INDEPENDENCE 

Another  important  and  useful  concept  is  that  of  STATISTICAL 
INDEPENDENCE.   Two  events  are  called  statistically  independent  if  know- 
ledge of  the  occurrence  of  one  does  not  affect  the  probability  of 
occurrence  of  the  other.   What  this  means  for  events  A  and  B,  where 
P(A)  &.  0  and  P(B)  »  0,  is  that  P(b|a)  =  P(B)  and  P(A|B)  =  P(A).   From 
conditional  probability  it  is  known  that 

.      P(B  and  A) 
P(B|A)  =    p(A)    or 

P(B  and  A)  =  P(B  ■  A)  =  ?(b|a)  •  P(A), 
but  since  P(BjA)  =  P(B),  it  is  said  that  A  and  B  are  statistically 
independent  if 

P(A  •  B)  =  P(A)  *  P(B)  . 
For  example,  if  an  air  screen  barrier  and  a  destroyer  barrier  are 
placed  about  a  convoy  in  such  a  manner  that  the  probability  of  detection 
on  one  of  the  screens  does  not  affect  the  probability  of  detection  on 
the  other  screen,  then  the  joint  probability  of  detection  of  the  submarine 
in  both  screens  is  simply  the  product  of  the  probability  of  detection 
from  each  of  the  screens.   Therefore,  if  the  air  screen  barrier  has  a 
0.50  probability  of  detection  and  the  destroyer  screen  barrier  has  a 
0.30  probability  of  detection,  then  the  probability  that  the  submarine 
will  be  detected  on  both  screens  is 

P(Detection  on  both  AS B  and  DSSB)  =  (0.50)  (0.30)  =  0.15. 
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This  probability  is,  of  course,  contingent  upon  the  submarine  attempting 
to  penetrate  the  screens.   (The  fact  that  this  joint  probability  is 
smaller  than  either  of  the  (marginal)  probabilities  which  comprise  it 
may,  at  first,  seem  intuitively  incorrect;  that  is,  it  is  easy  to 
misinterpret  the  statement  of  what  probability  is  being  sought  in  such 
a  problem.   However,  when  it  is  realized  that  what  is  sought  here  is 
the  probability  that  both  platforms  simultaneously  hold  contact  on  the 
submarine,  the  smaller  resultant  (joint)  probability  then  seems  more 
intuitively  consistent.) 

A  major  distinction  between  statistically  independent  events  and 
mutually  exclusive  events  is  that  the  joint  probability  of  the  mutually 
exclusive  events  is  zero,  since  the  occurrence  of  one  of  a  group  of 
mutually  exclusive  events  precludes  the  (simultaneous)  occurrence  of 
any  other  event  in  that  group.   The  joint  probability  of  independent 
events,  on  the  other  hand,  is  clearly  not  zero  unless  at  least  one  of 
the  independent  events  has  zero  probability  of  occurrence;   If  A  and  B 
are  independent,  then 

P(A  and  B)  =  P(A)  *  P(B)  =  0  , 
if  and  only  if  P(A)  =  0  or  P(B)  -  0. 

Supplementary  solved  problems  covering  elementary  probability  theory 
are  presented  at  the  end  of  Chapter  6  of  Schaum's  Outline  Series,  Theory 
and  Problems  of  Statistics  and  at  the  end  of  Chapters  3  and  4  of  Schaum's 
Outline  Series,  Theory  and  Problems  of  Probability.   These  problems  should 
be  consulted  for  a  further  review  of  the  material  presented  in  this 
chapter. 
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CHAPTER  V:   RANDOM  VARIABLES 

RANDOM  VARIABLES 

A  variable  which  assumes  a  numerical  value  determined  by  the  out- 
come of  a  probability  experiment  is  called  a  random  variable.   Actually 
a  random  variable  is  a  shorthand  notation  for  listing  each  possible  out- 
come of  an  experiment.   For  example,  if  a  coin  is  tossed  two  times  and 
the  number  of  heads  is  counted,  then  the  random  variable  may  assume  the 
value  0,  1,  or  2. 

There  are  two  types  of  random  variables,  discrete  and  continuous .   A 
discrete  random  variable  is  a  random  variable  X  on  a  sample  space  which 
has  a  finite  or  countably  infinite  number  of  outcomes;  say,  X(S)  = 
x^,  X2,...,xn  .   We  can  make  X(S)  into  a  probability  space  (conditions 
on  this  are  listed  with  the  probability  distribution  below)  by  defining 
the  probability  of  x^  to  be  P(x  =  i),  which  may  also  be  written  f(x^). 
This  function  f  on  X(s),  defined  by  f(x.)  =  P(X  =  x^),  is  called  the 
distribution  or  probability  function  of  X  and  is  usually  given  in  the 
form  of  a  table,  as  shown  in  Table  5.   (It  should  be  noted  that  in  the 
discrete  case  the  distribution  is  the  probability  function  developed 
in  Chapter  IV.) 


x1 

x2 

•  •  • 

xn 

r(xj) 

f(x2) 

•   •   • 

fUn) 

Tsble  5:   Distribution  Function  of  X 
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The  distribution  f  satisfies  the  conditions: 


f(x.)  >  Oand  I  f(x  )  =  1 
1  i=l  - .   * 


For  example:   suppose  a  pair  of  fair  dice  is  tossed.   We  obtain  the 
finite  equiprobable  space  S  consisting  of  the  36  ordered  pairs  of 
numbers  between  1  and  6: 

S  =  (1,1),  (1,2),  (1,3),...,  (6,6) 
Let  X  assign  to  each  point  (a,b)  in  S  the  maximum  of  a  and  b;  that  is. 
X(a,b)  =  MAX  (a,b)  =  X(S)  =  1,  2,  3,  4,  5,  6.   We  compute  the  distribution 
f  of  X: 

f(l)  =  P(X  =  1)  =  P(  1,1  )  =  1/36 
f(2)  =  P(X  =  2)  =  P(  (2,1),  (2,2),  (1,2)  )  =  3/36 
f(3)  =  P(X  =  3)  =  P(  (3,1),  (3,2),  (3,3),  (2,3),  (1,3)  )  =  5/36 
and  similarly  f(4)  =  P(X  =  4)  =  7/36,  f(5)  =  P(X  =  5)  =  9/36  and 
f(6)  =  P(X  =  6)  =  11/36.   The  information  is  put  into  the  form  of  a 
table  as  shown  in  Table  6. 


xi    = 

1 

2 

3 

4 

5 

6 

f(Xi)  = 

1/36 

3/36 

5/36 

7/36 

9/36 

11/36 

Table  6:   Distribution  f  of  X 


It  can  also  be  seen  that  f  (x^)  -Ss  0  and 
6 

Zf(xi)  =  1/36  +  3/36  +  5/36  +  9/36  +  11/36  =  36/36  =  1. 
i=l 
A  continuous  random  variable  is  one  in  which  the  set  of  numbers 

X(S')  is  an  interval  or  a  set  of  intervals.   If  the  interval  (a  2  X  ^T  b) 

is  an  event  in  S,  then  the  probability  would  be  P(a  «  X  *  b)  and  is 

shown  in  Figure  15. 
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Figure    15:      F(a   £  X  £  b)    =  Area  of  Shaded  Region 
Where   P(a  ^r     x  !f     b)    corresponds    to  -the  area   under    the   graph  of   f   between 

rb 

x  =  a  and  x  =  b.   In  the  language  of  calculus,  P(a  tf  x  ^  b)  =  J  f(x)  dx, 
which  in  effect  can  be  approximated  by  adding  up  areas  under  the  curve 


between  a  and  b.   For  example,  take  the  area  shown  in  Figure  15  and 
partition  it  into  smaller  areas  as  shown  in  Figure  16a  and  b,  and  then 
sum  the  areas  of  the  resulting  rectangles. 


Fi  r-ure  1  6a 


Firure  1 6b 
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An  approximation  of  the  area  of  each  subregion  of  Figure  16a  and  16b 
can  now  be  made  by  taking  the  area  of  the  rectangle  which  has  the  same 
base  as  the  subregion  and  height  equal  to  the  minimum  height  for  that 
subregion.   By  adding  up  the  areas  of  the  approximating  rectangles,  an 
approximation  is  now  obtained  for  the  area  of  the  total  region  between 
a  and  b.   The  area  that  has  not  been  included  is  the  shaded  area,  which 
makes  the     results  only  an  approximation.   It  can  be  seen  that,  due 
to  the  increased  number  of  partitions  each  with  a  smaller  base  of 
Figure  16b,  the  shaded  area  is  smaller,  which  results  in  a  better 
approximation.   Actually  the  greater  the  number  of  partitions,  the 
smaller  the  shaded  area,  and  the  better  the  approximation.   If  f(x)  is 
known,  the  exact  area  between  a  and  b  may  easily  be  determined  by  using 
calculus.   However,  it  should  be  noted  that  the  use  of  calculus  is  not 
required  for  most  practical  cases  of  interest  in  this  course,  as  values 
are  obtained  from  standard  tables.   Standard  tables  exist  for  most 
continuous  random  variable  distributions  used  in  ASW  applications;  there- 
fore, it  is  not  intended  that  the  student  have  a  working  knowledge  of 
calculus.   The  following  example  is  given, therefore,  simply  to  show  an 
appreciation  of  calculus: 

Let  X  be  a  continuous  random  variable  with  the  following  distribution: 


f(x) 


{ 


f(x) 


x/2   if  0  6  x  £  2 
0   elsewhere 


Figure  17:   Graph  of  f 
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i)  =/  f(x)dx  =/   ^xdx  = 


=  x 


1.5 

=  .5625  -  .25  =  .3125, 
1 


where  it  is  seen  that  the  calculus  operation  of  integration  has  been 
utilized  to  "sum"  the  area  of  the  region  under  the  curve  between  1  and 
1.5  as  shown  in  Figure  17. 

DENSITY  FUNCTION 

The  function  that  describes  the  probability  of  each  interval  on  the 
x-axis  (for  example,  the  bimodal  function,  f(x),  in  Figure  15)  is  called 
a  continuous  probability  function  or  simply  the  (probability)  density 
function  of  X.   The  density  function  must  satisfy  the  constraints  that 
f(x)  ^  0  for  all  x,  and  that  the  total  area  under  the  density  function 
must  equal  1.   Special  notice  should  be  given  the  fact  that  the 
probability  of  a  point-value  of  the  random  variable  X  is  strictly  equal 
to  zero,  even  though  the  value  of  f  at  the  point  x  may  be  non-zero.   An 
intuitively  appealing  explanation  of  why  the  probability  of  a  single 
outcome  of  a  continuously  distributed  random  variable  is  zero  is  given 
in  the  following  example. 

Consider  an  attack  submarine  (SSN)  searching  for  enemy  submarines. 
The  probability  that  an  initial  detection  bearing  will  be  exactly  some 
prescribed  angle  is  zero.   The  key  word  here  is  exactly.   For  example, 
the  word  "exactly"  would  mean  that  a  prescribed  bearing  of  30°  is 
exactly  that  right  down  to  every  decimal  place.   If  the  "exactly"  is  not 
used,  then  "30°"  may  represent  a  whole  event,  which  would  have  a  positive 
probability.   For  example,  if  we  are  rounding  off  to  the  nearest  degree 
and  the  density  function  is  constant  over  the  interval  from  0   to  360  , 
then  "30°"  would  mean  the  event  from  29.5000...°  to  30.499...°,  which  has 
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probability  1/360.   If  we  are  rounding  off  to  the  nearest  minute,  then 
"30°"  refers  to  the  interval  from  29°59.500. . . '  to  30° 0.599.  ..' ,  which 
has  probability  1/21,600.   If  we  round  off  to  the  nearest  second,  then 
"30°"  is  the  event  from  29°59 ' 59 .500"  to  30°0'0.499. . .",  which  has 
probability  1/1,296,000.   "Exactly  30°"  is  a  single  point,  not  an  interval, 
and  its  probability  must  be  smaller  than  the  probability  of  every 
interval  that  contains  it,  no  matter  how  small.   Thus  the  probability  of 
a  bearing  angle  of  "exactly  30°"  is  zero. 

CUMULATIVE  DISTRIBUTION  FUNCTION 

Let  X  be  a  random  variable,  x  a  real  number,  and  F(x)  the  probability 
that  X  takes  on  values  less  than  or  equal  to  x: 

F(x)  =  P(X  *  x) 

Then  the  function  F  is  called  the  cumulative  distribution  function 
(CDF)  of  X.   CDF  values  are  of  particular  importance  in  ASW  in  their 
application  to  search  regions.   For  example,  (in  the  continuous  random 
variable  case)  it  is  important  to  know  what  the  probability  of  contact 
on  a  target  along  the  search  length,  L,  is  up  to  and  including  the 
point  X  =  L/2,  where  exactly  half  (or  1/3  or  3/4  or  some  other  portion) 
of  the  area  has  been  searched.   This  type  of  information  is  essential  to 
allow  reasonable  allocation  of  available  search  effort.   Further,  the  CDF 
of  the  target  evaluated  at  the  Xu  value  corresponding  to  that  point 
half-way  (in  this  example)  through  the  search  region  represents  this 
desired  probability  (cumulative),  as  seen  in  Figure  18,  where,  for  purposes 
of  generality,  x  is  taken  to  be  the  middle  or  center  x  value  of  the  search 
region's  axis. 
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F(XH)  = 
P[:<  £  xH] 


F(x)    Cumulative  Probability  of  Detection 
A  after  i  of  Re -'ion  is  Searched 


Figure  18:-  C^F  of  Search  Region 

For  example,  using  the  data  shown  in  Table  6,  the  cumulative  distribu- 
tion function  F  of  X  would  be  as  shown  in  Figure  19  which  consists  of 
summing  the  frequencies,  f(x^),  corresponding  to  those  values  of  x  less 
than  or  equal  to  x^. 


36/36 

27/36 

18/36 

9/36 
< — 


=  2*4 - 


11/36 


7/36  . 

,  /,*  v*g,  5/3-6*. « 

1  /3o;  'I  ^\ f 

r*  1         i         i 


9/36  : 


-> 


Figure  19:   Graph  of  F 

Observe  that  F  in  Figure  19  consists  of  a  sequence  of  horizontal  line 
segments  and  two  rays.   One  ray  coincides  with  that  part  of  the  x-axis 
to  the  left  of  x  =  1  because  P(X  ^  x)  is  0  if  x  <  1.   There  is  a  jump 
in  the  graph  at  x  =  1  because  F(x)  =  0  to  the  left  of  1  and  F(l)  =  1/36. 
The  graph  has  constant  height  F(x)  =  1/36  for  l^x^  2.   Atx=2 
there  is  a  jump  of  3/36  (or  f(2)  =  3/36)  bringing  F(x)  up  to  F(2)  =  4/36. 
As  we  proceed  along  the  graph  from  left  to  right,  we  find  a  jump  at  each 
integer  x  =  1,  2,  3,  4,  5,  6.   For  x  *  6,  the  graph  has  another  ray 
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extending  indefinitely  to  the  right  because  F(x)  =  P(Xf:  x)  is  1  if  X  is 
any  real  number  greater  than  or  equal  to  6.   If  we  desire  to  find  the  CDF 
for  a  number  between  the  integer  values,  for  example  F(5/2)  we  can  see 
from  Figure  19  that  F(5/2)  =  4/36.   To  compute  the  CDF  of  a  continuous 
random  variable,  it  is  again  necessary  to  use  calculus;  however,  its 
values  can  usually  be  obtained  from  a  standard  table. 

MEAN,  VARIANCE  AND  STANDARD  DEVIATION 

The  mean,  variance  and  standard  deviation  are  also  used  to  describe 
a  random  variable.   These  will  be  shown  here  for  discrete  random  variables 
only,  as  the  values  for  continuous  random  variables  will  be  found  from 
tables. 

The  mean  or  expectation  of  X,  denoted  by  E(X)  (or  simply  E  or  fJ    ) 

is  defined,  for  discrete  random  variables,  by  E(X)  =  x   f (x  )  +  x   f(x  )  + 

n 
•••  +  xn  f(xn)  =  Z  xi  f(x.)  . 
i=l 

That  is,  E(X)  is  the  weighted  average  of  the  possible  values  of  X, 

where  each  value  is  weighted  by  its  probability.   For  example,  to  compute 

the  mean  of  X  for  the  data  of  Table  6, 

6 
E(X)  =  I  x±   f(x.)  =  1(1/36)  +  2(3/36)  +  3(5/36)  +  4(7/36)  +  5(9/36)  + 
i=l       X 

6(11/36)  =  161/36  =  4.47  . 

The  mean  of  a  random  variable  X  measures,  in  a  certain  sense,  the  "average" 

value  of  X. 

The  variance  of  X  measures  the  "spread"  or  "dispersion"  of  X.   Let  X 

be  a  random  variable  with  the  distribution  shown  in  Table  5.   Then  the 

n 
variance  of  X,  denoted  by  Var(X),  is  defined  by  Var(X)  =  I   (x .  -   /i)2f(xi) 

2  1=1    l 

=  E((X    fj  )   )  where  [I      is  the  mean  of  X.   For  example,  using  the  data 
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n 


from  Table  6  Var(X)  =  I  (x .  -   /i)2  f(x.)  =  (1  -  4.47)2  1/36  + 

i-1  L 

(2  -  4.47)2  3/36  +  (3  -  4.47)2  5/36  +  (4  -  4.47)2  7/36  +  (5  -  4.47)2  9/36  + 

(6  -  4.47)2  11/36  =  .334  +  .508  +  .3  +  .043  +  .07  +  .715  =  1.97  . 

Since    the   standard   deviation  of  X,    denoted   by        rf  ,    is    the 

(nonnegative)    square   root  of  Var(X):       rj      =     y   Var(X)    .      For   example, 

using   the  data   from  Table   6,    the   standard  deviation 

rf    =   V  Var(X)    =    I/  1.97  =   1.4    . 


52 


CHAPTER  VI:   BASIC  STATISTICAL  DISTRIBUTIONS 

BACKGROUND 

Generally,  systems  which  arise  in  solving  many  practical  ASW  problems 
must  be  describable  in  terms  of  a  simple  mathematical  model,  and  in  most 
interesting  cases  this  mathematical  description  is  in  probabilistic 
(rather  than  deterministic)  terms.   Before  a  problem  can  be  analyzed  for 
probabilistic  solutions  the  basic  unknown  quantities  must  be  represented 
as  random  variables  (which  constitute  the  mathematical  model).   Once 
this  is  done,  the  density  function  or  the  CDF  is  determined,  and  the 
system  is  thereby  completely  described  in  probabilistic  terms.   The  only 
task  remaining  then  is  to  compute  the  probabilities  which  constitute  a 
problem  solution,  and  this  in  itself  is  theoretically  straight  forward. 
However,  it  should  be  noted  that  some  problems  can  be  modeled  more 
easily  than  others;  yet  an  important  advantage  of  this  method  is  the 
possibility  of  describing  many  ASW  problems  in  terms  of  a  few  "well 
known"  random  variables.   The  distributions  which  are  developed  here  are 
the  Bernoulli,  Binomial,  Chi-Square,  Uniform,  Normal,  Bivariate  Normal, 
Time-Varying  Normal,  Exponential  and  Poisson. 

BERNOULLI  DISTRIBUTION 

A  random  variable  X  with  exactly  two  possible  values,  say  1  and  0, 
where  the  probabilities  P  X  =  1   =  p  and  P  X  =  Oj   =  1  -  p  =  q,  is 
called  a  Bernoulli  random  variable.   Such  a  random  variable  is  important 
because  it  may  be  used  to  describe  any  random  experiment  with  two  possible 
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outcomes,  such  as  target  detection  from  a  single  ping  of  an  active  sonar. 
Indeed,  it  is  the  Bernoulli  distribution  that  is  used  in  making  a  decision 
requiring  a  "Yes"  or  "No"  answer.   However,  for  our  purposes  Bernoulli 
random  variables  are  used  primarily  as  the  basis  for  the  Binomial 
distribution,  which  is  discussed  next. 

BINOMIAL  DISTRIBUTION 

The  Binomial  distribution  uses  the  two  alternatives  of  the  Bernoulli 

distribution,  P  X  =  1   =  p  and  P  1  X  =  0  i   =  q,  and  an  n-fold  repetition 

of  the  basic  experiment.   The  problem  now  is  to  find  the  probability  of 

exactly  i  successes  in  n  repeated  independent  trials;  for  example,  the 

probability  of  exactly  two  detections  in  six  pings  of  an  active  sonar,  or 

the  probability  of  one  detection  in  four  sweeps  of  a  radar.   The  Binomial 

random  variable  with  parameters  n  and  p  is  the  discrete  random  variable 

given  by  P  [x  =  i]   =  (  ?  )piqI1"i>  i  =  0,  1,  2,  ...  ,  n,  and  P  [  X  =  ij  =  0, 

otherwise.   (Recall  that  (?)  was  defined  in  Chapter  III  as     n- • 

i!(n  -  i)l 
In  order  to  use  the  Binomial  distribution  to  find  the  probability  of 

exactly  2  (or  i)  detections  in  6  (or  n)  pings  of  an  active  sonar,  the 

probability  of  a  single  ping  success  must  be  known.   For  example,  if 

the  probability  of  success  is  0.6,  or  p  =  0.6,  then  q  =  1  -  p  =  0.4  and 

therefore  P  fx  =  2]  =  (n)  pV"1  =  (f)  (0.6)2    (0.4)6'2  = 

_ §1 (0.6)2  (0.4)4  =  6  •  5  '  4  '  3  '  2  '  1  (0.36)  (0.0256)  = 

2!  (6  -  2):  2  '  1  '  4  •  3  •  2  ■  1 

(15)  (0.36)  (0.0256)  =  0.138  . 

The  mean  of  a  Binomial  random  variable  with  parameters  n  and  p  is  \X  =  np, 
the  variance  is  rf2  =  nPq  ar*d  the  standard  deviation  (j    =~\fnpq.      In 
the  above  example,   JJ   =  np  =  (6)  (0.6)  -  3.6,  QL   =  npq  =  (6)  (0.6)  (0.4)  = 
1.44  and  (J   =  VI. 44  =  1.2.   With  a  probability  of  0.6  and  6  pings,  we 
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would  expect  3  to  4  detections  from  every  6  pings  on  the  average;  for 
this  problem  Q   indicates,  with  a  687,  confidence,  that  the  number  of 
detections  from  6  pings  is  between  2.4  and  4.8  on  the  average. 

The  following  problems  show  how  the  binomial  distribution  can  be  used; 

1.  A  P-3C  aircraft  has  been  loaded  with  eight  active  sonobuoys 
which  are  known  to  have  a  20%  failure  rate.   These  buoys  are  to  be 
utilized  in  a  situation  requiring  that  failed  buoys  be  replaced 
immediately  to  maintain  pattern  integrity.   As  this  requires  fast, 
coordinated  action  by  the  aircrew  to  reposition  the  aircraft  for  "delivery" 
and  to  prepare  replacement  buoys,  it  is  important  for  them  to  estimate 

the  number  of  buoys  requiring  replacement.   Determine  the  probability 
that,  out  of  four  sonobuoys  selected  for  a  pattern, 

(a)  1  sonobuoy  will  fail 

(b)  0  sonobuoys  will  fail 

(c)  2  sonobuoys  will  fail 

(d)  at  most  2  sonobuoys  will  fail. 

(Note  that  part  (d)  can  be  simplified  through  use  of  parts  (a),  (b),  (c), 
and  the  probability  complement  concept.) 

Answer:   (a)   P  [x  =  l]   =  (£)  (0.8)3  (0.2)1  =  ^   4;     (0.2)(0.8)3 

=  (4)  (0.2)  (0.51)  =  0.4096 

(b)  0.4096 

(c)  0.1536 

(d)  P  [x  «  2]   =  1  -  P  [x  *  2]   = 

1  -  [  (f)  (0.2)3  (0.8)1  +  (4)  (0.2)4  (0.8)°]  =  0.9728 

2.  If  the  probability  of  damaging  a  submarine  with  a  single  depth 
charge  from  a  destroyer  is  0.5,  what  is  the  probability  that  6  out  of  10 
depth  charges  will  damage  the  submarine? 
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Answer:   0.21 

3.  Compute  the  (a)  mean,  (b)  variance,  and  (c)  standard  deviation  of 
a  binomial  distribution  in  which  p  =  0.7  and  n  =  60. 

Answer:   (a)   42 

(b)  12.6 

(c)  3.55 

4.  Consider  a  box  of  200  fuses.   It  is  known  that  for  each  fuse  the 
probability  of  being  defective  is  0.0.2.   What  is  the  probability  that 

at  most  six  fuses  are  defective?   (Set  up  the  problem  but  do  not  carry 
out  the  calculations;  although  the  solution  is  not  too  difficult  if  one 
has  the  use  of  a  pocket  calculator.   It  will  be  seen  later  that  a  good 
approximation  to  this  answer  can  be  obtained  much  easier  by  means  of 
the  Poisson  approximation.) 
Answer:  6 


l      J    i=0  i! (n  -  i): 


6 

=  I   ,,,,^-  ,,,   (0.02)1  (0.98)200  -  i 
i=0  x! (200  -  i): 

=  0.8914 


UNIFORM  PROBABILITY  DISTRIBUTION 

The  Uniform  probability  distribution  represents  situations  in  which 
any  two  sub-regions  of  the  same  size  have  equal  probabilities,  as  explained 
below.   This  distribution  is  shown  in  Figure  20,  which  shows  that  the 
random  variable  X  may  assume  only  values  between  X  =  a  and  X  =  b,  or 
a  *  X2  b,  with  non-zero  probability  (the  probability  of  any  region 
outside  the  interval  a  *  X  *  b  is  zero). 
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*X' 

^ 

b  -  a 

a 

b           X 

Figure   20:      Uniform  Probability   Distribution  and   Its    CEF 

Since    the  area   under  a   probability  distribution  must   equal   one,    height 

of   the  density   function  between  a  and  b  must  be        |         .      The  mean  value 

b   -   a 

of  a  Uniform  distribution  coincides  with  the  central  x-value,  and  is 
(b  +  a)/2.   The  variance  is  (a  -  b)2/12.   The  figure  also  shows  the 
cumulative  distribution  function  for  the  Uniform  distribution. 

A  Uniform  distribution  is  described  by  writing  "Uniform  (a,b)  or 
U(a,b  )  distribution";  for  example,  a  Uniform  distribution  having  its 
non-zero  probability  values  in  the  interval  0  to  1  is  denoted  as  "a 
Uniform  zero,  one  distribution." 

An  example  application  of  the  Uniform  random  variable  is  the 
probability  distribution  of  the  location  of  a  submarine  in  a  narrow  or 
restricted  body  of  water.   If  it  is  known  that  the  submarine  will  not  be 
in  water  less  than  100  feet  deep,  then  the  probability  that  the 
submarine  will  be  in  such  regions  is  zero.   If  there  is  no  information 
to  indicate  any  preference  in  location,  it  must  be  assumed  that  the 
probability  distribution  for  the  location  of  the  submarine  is  uniform, 
as  shown  in  Figure  21. 
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Figure  21:   Frofile  of  a  Narrow  Channel  and  nssociated 
Uniform  Probability  Distribution 

Suppose  that  a  =  1,500  yards  and  b  =  20,000  yards  from  a  reference 

position  (at  one  of  the  restriction  boundaries,  for  example).   The 

probability  that  the  submarine  is  located  in  any  subinterval,  say  of 

width  1,000  yards,  is 

P  I"  c<  X  *  d]  =  (d  -  c)     1 
L  J  (b  -  a) 

=  (1,000  yards)  — I yards 

18,500 

=  0.05 
Special  note  should  be  made  of  the  fact  that  the  dimensions  (yards,  in 
this  example)  associated  with  the  physical  problem  must  cancel  so  that 
the  resultant  probability  is  a  dimensionless  number.   The  probabilities 
of  any  two  intervals  of  the  same  length  within  the  interval  from  a  to  b 
are  equal;  that  is, 

p[c^X*d]  =P[e^x:ff]   if  d  -  c  =  f  -  e, 
where  both  c  *  X  ^  d  and  e*  X-  f  are  within  a  *  X  ^  b. 

Suppose  that  two  sensors  at  separate  locations  each  produce  a  line  of 
bearing  (LOB)  to  a  submarine  target.   Each  of  the  LOB's  has  a  bearing 
error  of  ten  degrees  (_  5°)  associated  with  it.   Suppose  further  that  the 


58 


LOB's  intersect  in  a  plane  region  "centered"  117.5  n.m.  from  the  first 
sensor  and  66.5  n.m.  from  the  second.   The  information  available  to  a 
prosecuting  platform  is  that  the  location  of  the  submarine  is  Uniformly 
distributed  over  the  plane  region.   Since  a  bearing  error  of  1  degree 
represents  a  range  error  of  about  1  n.m.  (perpendicular  to  the  LOB)  at 
a  range  of  60  n.m.,  the  area  is  a  trapezoid  as  shown  by  the  "cross- 
hatched"  region  in  Figure  22. 


Approx.  12  mi. 


rtpprox,  20  mi. 


"^Approx .  12  mi  . 
Approx.  20  mi. 


North 

♦  117.5  mi.- 
I  LCS  #1  = 


Sensor  #1 


Sensor  #2 


Figure  22:   Trapezoid  Produced  by  Two  Sensors 

Suppose  sensors  #1  and  #2  are  separated  by  135  n.m.  and  the  LOB's  are 
60  degrees  from  sensor  #1  and  330  degrees  from  sensor  #2;  then  the 

trapezoid  approximates  a  rectangle  with  sides  20  by  12  n.m.;  therefore, 

2 
the  probability  region  has  an  area  of  about  240  n.m.  .   If  a  sonobuoy 

with  a  3  mile  radius  of  detection  is  dropped  in  the  probability  region, 
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(a)  what  probability  of  detection  does  this  sonobuoy  yield1: 

(b)  what  probability  of  detection  would  six  nonoverlapping 
sonobuoys  yield? 

Answer:   (a)   Probability  of  Detection  =  area  of  sonobuoy 

area  of  trapezoid 

*   TTr2  =  28.27  =  0.12 
240     240 

(b)   Probability  of  Detection  =  0.71 


NORMAL  DISTRIBUTIONS 


THE  NORMAL  DISTRIBUTION 

The  Normal  distribution  is  probably  the  most  important  continuous 
random  variable  used  in  ASW.   It  is  used  to  describe  the  sum  of  a  large 
number  of  statistically  independent  random  variables.   The  Normal 
probability  density  function  is  defined  as: 


f(X)  = 


■%(x  -  /i)2/  rf2   ,    ,,  . 

r-'       \j  for  an  real  numbers  x, 


where  fj      and  fj       are  the  mean  and  variance.   We  denote  the  Normal 
distribution  with  mean  fJ    and  variance  fj*   by  N(  /i  ,  (j     ) .   Figure  23 
shows  a  Normal  distribution  of  N(0,1). 

f 


Figure   23:      Normal   Distribution  11(0,1  ) 
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The  probability  that  a  Normally  distributed  random  variable  X  is  less 
than  or  equal  to  b,  (which  is  the  Cumulative  Density  Function)  is  given 
by: 


X 


roo 

(b)  -  P  X  f  b   =  /  ^_  e"*< 

1    J  L  *  ^ 


-%<x  ■  ao2/ct2  dx. 


Evaluation  of  this  intergal  can  be  circumvented  by  use  of  standard 
tables.   The  probability  that  X  is  less  than  or  equal  to  b  can  be 
represented  by  the  shaded  area  in  Figure  24,  and  its  magnitude  is 
determined  with  the  aid  of  tables  in  Appendix  A. 


x=b  jx 


Fi~ure   24:      The    Density   Function  of   the  Normal   Distribution 

Note   that   the  Normal   density   function   is   dependent  only   on   the   two 
parameters    jj    and     fj^-      If  U    and     rf*  are   both   specified,    probabilities 
can  be   obtained   from   the   table   in  Appendix  A.      If 

fx(b)  ■  -$r  e"b2/2 ' 

then  the  random  variable  X  has  a  Normal  distribution  with  mean  0  and 
variance  1,  and  X  is  called  a  Standard  Normal  random  variable.   Most 
tables  are  written  for  the  Standard  Normal  Distribution;  therefore,  to 
obtain  probabilities  for  any  other  mean  or  variance  values  the  appropriate 
conversions  must  be  made.   To  compute  the  probability  that  X  lies  between 
a  and  b,  convert  a  and  b  into  standard  units: 

a'  =  (a  -  V)/tf  and     b'  =  (b  -   /i)/CT 

respectively. 
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Then 

P(a  *  X  *  b)  =  P(a*  *  X*  2  b1)  =  area  under  the  Standard  Normal 
curve  between  a1  and  b',  where  X  is  the  Standard  Normal  random  variable. 

The  following  problems  are  used  to  familiarize  the  student  with  the 
use  of  the  Standard  Normal  distribution  N(0,1)  and  Appendix  A: 

1.   Let  X  be  a  random  variable  with  the  Standard  Normal  distribution 
N(0,1). 

Find:     (i)   P(0  *  X  2  1.42)        (iv)   P(0.65  2  X  2  1.26) 

(ii)  P(-0.73  2  X  2  0)        (v)  P(-1.79  2  X  2  -0.54) 
(iii)   P(-1.37  *  X  *  2.01)    (vi)   P(X  >  1.13) 
(i)   P(0  *  X  2  1.42)  is  equal  to  the 

area  under  the  Standard  Normal  curve 
between  0  and  1.42.   Thus  in  Appendix 
A,  page  A-l,  look  down  the  second 
column  until  1.42  is  reached.   The 
entry  is  .4222.   Hence  P(0  2  X  2  1.42) 
=  .422. 
(ii)   By  symmetry,  P(-0.73  2  X  *  0)  = 
P(0  2  X  2  0.73)  =  .2673 
(iii)   P(-1.37  2X2  2.01)  = 
P(-1.37  2  X  2  0)  + 
P(0  2X2  2.01)  = 
.4147  +  .4778  =  .8925 
(iv)   P(0.65  2X2  1.26)  = 
P(0  2  X  2  1.26)  - 
P(0  2  X  2  0.65)  = 
.3926  -  .2422  =  .1540  0  ..6[ 


// 


0   1  .42 


-..73  0 


-1.37  0  2.01 
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(v)   P(-1.79  *  X  *  -0.54)  = 
P(0.54  Z     X  *  1.79)  = 
P(0  *  X  *  1.79)  - 
P(0  *  X  *  0.54)  = 
.4633  -  .2054  =  .2579 
(vi)   P(X  >  1.13)  = 

P(X  *  0)  -  P(0  *  X  *  1.13)  = 
.5000  -  .3708  =  .1292 


-1.79  -..54  0 


0  1.13 


2.   Let  X  be  Normally  distributed  with  mean  8  and  standard  deviation 
4.   Find: 

(i)  P(5  1    X  *  10)  (iii)  P(X  *  15) 

(ii)   P(10  *  X  *  15)  (iv)   P(X  *  5) 

(i)   P(5  2  X  ^  10),  must  first  convert  to  the  standard  normal 
distribution  N(0,1) 
P(a  2  X^  b)  =  P(a'  f  X*  *  b') 
a'  =  (a  -  II) /(f       =   (5  -  8)/4  =  -.75 
b'  =  (b  -  fl)/(j         =    (10  -  8)/4  =  .5 

therefore  have  P(-.75  1     X  *  .5)  =  P(-.75  -  X*   0)  +  P(0*  X^  0.5)  = 
.2734  +  .1915  =  .4649 
(ii)   P(10  *  X  2  15)  =  P(.5  *  X*  1    1.75)  = 
P(0  *  X  1     1.75)  -  P(0  1    X*  *   .5)  = 
.4599  -  .1915  =  .2684 
(iii)   P(X  *  15)  =  P(X*  1     1.75)  = 


P(X 


0)  -  P(0  -  X  -  1.75)  = 


.5000  -  .4599  =  .0401 
(iv)   P(X  15)=   P(X*  *  -.75)  = 

P(X*tf  0)  -  P(-.75  -  X-  0)  = 
.5000  -  .2734  =  .2266 
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3.  Suppose  the  temperature  T  during  June  is  Normally  distributed 
with  mean  68°  and  standard  deviation  6°.  Find  the  probability  p  that 
the  temperature  is  between  70°  and  80°. 

70°  in  standard  units  =  (70  -  68)/ 6  =  .33 
80°  in  standard  units  =  (80  -  68)/6  =  2.00 
Then  p  =  P(70  *  T  *  80)  =  P(0.33  1     T  *  2)  = 
P(0  1     T*  *  2)  -  P(0  1    T*  -  0.33)  = 
.4772  -  .1293  =  .3479 

Here  T   is  the  standardized  random  variable  corresponding  to  T,  so 
T*  has  the  Standard  Normal  distribution  N(0,1). 

4.  Suppose  that  the  lifetime  of  a  certain  type  of  electronic  tube 
is  a  Normally  distributed  random  variable.   Suppose  further  that  brand 
A  has  a  lifetime  X  which  is  N(30  hours,  36  hours^),  while  brand  B  has  a 
lifetime  Y  which  is  N(45  hours,  9  hours  ) .   Of  these  two  brands,  which 
would  you  choose  for  use  in  an  experimental  aircraft  with: 

(a)  34  hours  mission  time 

(b)  40  hours  mission  time 

(Hint:   you  should  be  able  to  solve  these  questions  by  simply  sketching 
the  two  Normal  distributions). 

BIVARIATE  NORMAL  DISTRIBUTION 

The  Bivariate  Normal  and  the  Time-Varying  Normal  distributions  are 
proposed  inclusions  to  the  computer  program  of  the  P-3  aircraft  for 
tactical,  on-station  use  of  probability  theory,  and  although  a  thorough 
development  of  the  distributions  is  considered  beyond  the  scope  of  this 
course,  the  following  cursory  description  of  each  of  these  distributions 
should  be  sufficient  for  an  intuitive  idea  of  how  these  distributions 
are  used. 
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The  Bivariate  Normal  distribution  is  the  joint  probability  distribu- 
tion of  two  Gaussian  (or  Normal)  random  variables.   Basically,  this 
distribution  is  the  three-dimensional  extension  of  the  two-dimensional 
(the  value  and  its  frequency)  Normal  distribution.   As  seen  in  Figure  25, 
the  Bivariate  Normal  distribution  resembles  a  hat,  and  the  specific 
Bivariate  Normal  distribution  shown  has  a  central  value,  which  may  or 
may  not  be  located  at  the  origin. 


tUiJo) 


Figure  25:   Bivariate  Normal  Distribution 

The  Bivariate  Normal  distribution  is  useful  in  Search  Theory,  and 
for  two-dimensional  area  coverage. 

If  the  student  is  interested  in  further  development  of  the  Bivariate 
Normal,  reference  should  be  made  to  Selected  Methods  and  Models  in 
Military  Operations  Research,  P.  W.  Zehna,  et  al,  U.  S.  G.  P.O.,  1971, 
pages  49  -  61. 

TIME -VARYING  NORMAL  DISTRIBUTION 

The  Time-Varying  Normal  distribution  describes  a  Normal  random  variable 
whose  distribution  varies  as  a  function  of  time  as  well  as  of  another 
parameter  such  as  range.   This  distribution  varies  with  time,  and 
Figure  26  shows  an  example  where  the  mean  is  zero  for  all  values  of  time 
t,  but  the  variance  is  increasing  with  increasing  t. 
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Figure  25:   Time-Varying  Normal  Distribution 
The  distribution  shown  in  Figure  26  can  be  used  to  describe  a  submarine 
opening  a  reported  datum,  where  a  "datum  outbound"  situation  is  assumed 
to  exist.   As  the  figure  illustrates,  the  probability  that  the  submarine 
is  in  close  proximity  to  the  reported  datum—as  reported  at  time  tQ  or  0  — 
decreases  with  increasing  time.   This  is  indicated  by  the  spreading  out 
and  flattening  of  the  distribution  from  times  t,  to  t„. 

It  is  important  to  understand  the  development  thus  far  presented  to 
provide  correct  insight  into  what  this  distribution  represents.   Further, 
it  is  a  definite  advantage— and,  in  fact,  a  necessity  for  tactical,  real- 
time usage  —  to  have  a  digital  computer  to  perform  the  mathematics  associated 
with  the  Time-Varying  Normal  distribution  resulting  from  an  operator- 
input  assessment  of  the  tactical  situation.   The  computer  could  then 
output  the  pertinent  probabilities  and  options  for  operator  use.   Again, 
if  the  operator  is  to  utilize  the  computer  to  employ  the  distribution, 
then  it  is  imperative  that  he  understand  the  tactical  assumptions  it  is 
based  on. 
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CHI-SQUARE  DISTRIBUTION 

The  Chi-Square  random  variable  is  useful  for  ASW  applications  in  that 
it  can  be  used  to  describe  any  system  that  is  the  sum  of  the  squares  of 
Normal  independent  random  variables.   Probably  the  most  important  of 
these  applications  is  that  of  describing  radial  distance,  where  the 
distance  is  in  terms  of  two  or  three  directions,  each  of  which  has 
distance  distributed  Normally.   Therefore,  the  radial  distance  to  a 
specific  point  (from  the  origin)  is  the  square  root  of  the  sum  of  the 
squares  of  the  cartesian  coordinates  of  the  point  (by  the  Pythagorean 
theorem) . 

An  example  which  displays  the  utility  of  the  Chi-Square  distribution 
is  to  obtain  the  probability  that  a  submarine  is  within  5  miles  of  a 
central  datum  (the  origin,  for  convenience),  where  it  is  known  that  the 
probabilities  of  the  submarine  being  at  a  distance  in  miles  along  each  of 
the  x  and  y  axis  are  each  distributed  N(0,100)  due  to  error  in  the  system 
that  initially  located  the  submarine.   The  submarine's  position  can  be 
in  error  (or  vary  from  the  stated  datum  position)  in  both  the  x  and  y 
directions;  therefore,  its  position  may  be  specified  by  two  random 
variables,  X  and  Y.   Although  there  are  two  random  variables  involved, 
the  effect  of  using  the  Chi-Square  distribution  is  a  reduction  of  the 
problem  to  a  single  random  variable  R,  which  represents  radial  distance 
from  the  submarine  to  the  central  datum: 

R  =   VX2  +  Y2  . 
The  problem  is  to  find  the  probability  of  the  shaded  region  shown  in 
Figure  27. 
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Figure  27:   Sun  of  Squares  Less  Than  or  Equal  to  25  Miles 

That  is,  to  calculate  P  |  (X2  +  Y2)^  1.   5]  ,  or  equivalently, 
P  [X  +  Y   ~  25J  ,  we  apply  the  following  principle  from  probability 
theory  (refer  to  Central  Limit  Theorem  on  page  142  of  Schaum's  Outline 
Series,  Theory  and  Problems  of  Statistics):   The  sum  of  n  independent 
Normally  distributed  random  variables  forms  a  new  random  variable  known 
as  the  Chi-Square  random  variable  with  "n  degree  of  freedom";  this  is 
denoted  by 

i  4 .  x2 . 
1=1  i 

In  the  specific  two-dimensional  case  treated  here,  if  we  put 

2 
Xn  =  L-  X  and  X„  =  i-Y,  then  X  X  =  Xt- 
1    10        2   io        i=l  x      2 


X 


2] 


25, 
10' 


The  probability  is  now  denoted  by 

P  [x2  +  Y2  *  25  J  =  P  [l02X2  +  102X2  *  25  ]  =  P  [  X2  + 

P  [x2  +  X2  *  0.25]  ,  .or  =  *[X\    -  0.25  j  =  F  2(0.25). 

The  last  term  is  the  probability  that  the  Chi-Square  random  variable 
with  two  degrees  of  freedom,  is  less  than  or  equal  to  0.25.   This 
cumulative  value  is  found  from  the  "Percentage  Points  of  the  Chi-Square 
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Distribution"  in  Appendix  A.   This  value,  F     (0.25)  is  found  by  enter- 
ing  the  table  with  n  (the  left-hand  column)  value  of  2  (the  number  of 
degrees  of  freedom--  ]J   is  used  for  n  in  the  table),  and  reading  directly 
to  the  right  until  the  column  is  reached  that  has  the  tabulated  value 
closest  to  0.25.   That  column  is  then  followed  vertically  upward  until 
the  top  row  is  reached  —  the  row  is  labled  as  Q.   The  value  in  this  row 
and  column  is  the  complement  of  the  desired  probability,  and  it  is 


Q  yl    (0.25)=  0.90 


1   -  Q   v2    (0.25)   =   1   -   0.90;    therefore 
A2 

F    V2    (0.25)    =  0.10. 

Now,  consider  the  problem  of  finding  the  probability  that  a 

submarine  will  be  "killed"  by  a  moored  mine  if  it  is  known  that  the  kill 

capability  of  the  mine  is  a  function  of  the  radial  distance  between  the 

mine  and  the  submarine.   This  is  a  three  dimensional  problem,  as  the 

submarine  depth  is  also  considered.   The  kill  function  is  known  to  be 

certain  if  the  submarine  is  within  30  yards  of  the  mine,  and  the 

probabilities  that  the  submarine  will  pass  within  a  given  range  in  each 

of  the  three  coordinate  directions  (depth,  "North-South,"  and  "East-West") 

are  independently  distributed  N(0,625)  in  yards.   The  problem  is  to 

calculate  P  I"  (X2  +  Y2  +  Z2)  *  900J  . 

Let  Xi  =  _X =  _X 

Cx   25 

x  =  _X_  =  _I 
2   CTy   25 

Xo  =  _z_  =  _z 

Cfz    " 
(This  technique  is  that  used  to  normalize  and  standardize  a  Normal  random 
variable),  then 
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3        2  2 

1     Xi  "     *3 


So   that, 

P  [x2  +  Y2  +  Z2  ^   30  1     =  P  [  252(X2  +  xf  +  X2)   ^  900  ]     = 

Pfx2  +X2  +X2  ^900  1     =  Pfx2  +  X2  +  X2  *    1.44  1     = 

L    1          2          3  oc2  J             L     1          2          3                    J 


25 
F   2  (1-44)  . 

From  the  table  on  page  A-3, 

Q   ,  (1.44)  =  0.75,  and 
A3 

F   2  (1.44)  =  0.25  . 

Hence,  under  the  assumed  probability  distributions,  the  probability  that 
the  submarine  will  be  "killed"  is  0.25. 

There  are  other  methods  of  solving  this  problem  which  involve  working 
with  the  random  variables  X  and  Y;  however,  the  Chi-Square  method  is 
easier,  and  it  is  convenient  because  the  probabilities  associated  with 
it  can  be  simply  obtained  from  a  precomputed  table. 

The  following  problems  will  help  familiarize  the  student  with  the 

use  of  the  Chi-Square  distribution  and  Appendix  A: 

1.   Find  P  [   V2  2  21.5  1  =F      (21.5) 
L     20        J 


20  X2 

20 


Answer:   0.75 


2.   Find  P  [  X22Q-     1^.7  ] 


Answer:   0.25 


3.   Find  the  value  of  X  for  which  P    x\5   -  x  1  =  °-75 


Answer:   29.3 
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4.   An  antiaircraft  shell  designed  to  burst  at  a  specified  point  in 

space  has  x,  y,  and  z  errors  in  a  coordinate  system  with  origin  at  the 

specified  point  which  are  independent  Normally  distributed  random 

variables  X,  Y,  Z,  each  with  mean  0  and  standard  deviation  50,  with 

length  measured  in  feet.   Find  the  probability  that  the  radial  error  in 

the  burst  point  is  at  most  100  feet. 

Answer:  0.75 

EXPONENTIAL  DISTRIBUTION 


An  exponential  function  is  a  function  of  the  form  f(t)  =  e*-,  where 
e  is  a  constant  which  is  very  much  like  7T  ,  except  that  e  is 
approximately  2.7183,  vice  3.1416.   For  example,  if  t  =  2,  then 
f(2)  =  e2  =  (2.7183)2  =  7.3891,  as  seen  from  the  last  entry  opposite 
x  =  2.00  in  the  table  of  Appendix  A,  page  A-2.   The  specific  exponential 
function  of  interest  here  is  the  negative  exponential  or  f (t)  =  e    (see 
Appendix  A,  page  A-2  for  values  of  e_t)  which  is  the  reciprocal  of  the 
positive  exponential  shown  before,  or  f(t)  =  1/e1-. 


The  Exponential  distribution  itself  is  actually  f  (t)  =  \   e 

X 


-  A  t 


(t=  0)  where  A  >  the  lower  case  Greek  letter  "lambda,"  is  a  positive 
constant.   It  turns  out  that  1/A   is  the  mean  of  this  distribution.   The 
graph  of  the  Exponential  distribution  is  shown  in  Figure  28.   It  is  seen 
in  the  figure  that  the  function  (the  Exponential  distribution)  "decays" 
with  the  progression  of  time. 


*/(t) 


Xe   t  =  fx(t)^a  density  function 


Figure  28:   The  Exponential  Distribution 
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The  Exponential  distribution  can  be  used  to  describe  many  phenomena 

connected  with  military  applications.   An  example  where  such  a  function 

occurs  is  the  MTTF  (Mean  Time  To  Failure)  of  an  electronic  equipment. 

The  time  axis  represents  the  total  lifetime  of  the  equipment  itself. 

The  random  variable  X  represents  the  total  lifetime;  that  is,  to  say  that 

X  assumes  a  value  t  means  that  the  equipment  fails  at  time  t  . 

0  0 

The  cumulative  distribution  function  is 

F  (t)  =  1  -  e"  A  t,    t  *  0 
X 

Therefore,  the  table  in  Appendix  A,  page  A-2,  is  still  usable,  except 
that  the  value  found  in  the  table  must  be  subtracted  from  one.   Figure  29 
shows  the  Cumulative  Exponential  probability  distribution. 


Figure  29:   The  Cumulative  Exponential  Distribution  Function 

Practical  ASW  applications  of  the  Exponential  probability  distribution 
are  predicting:   (1)  the  mean  time  to  trail  a  submarine  and  (2)  the  mean 
time  to  fail  for  ASW  equipment. 

The  following  problems  show  some  of  these  applications. 

1.   Suppose  that  it  is  desired  to  trail  a  submarine  that  transits  a 
barrier.   Take  the  time  that  the  submarine  enters  the  barrier  to  be 
t  =  0,  and  let  P   be  the  probability  that  trail  is  initiated  at  t  =  0, 
where  P_  is  known.   In  this  problem  A  is  the  reciprocal  of  the  mean 
time  to  lose  trail  (or  the  average  rate  of  losing  trail).   For  this 
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problem  let  1/A   be  20  days,  and  let  P   =  0.5.   What  is  the  probability 
the  submarine  is  trailed  at  time  t  =  10  days?   Since  1/A   =  20, 
>  =  1/20  =  0.05;  therefore, 

P(t)  =  PQ  (1  -  e-  A  t)  =  0.5  (1  -  e-(0-5)(10))  = 

0.5  (1  -  0.6)  =  (0.5)  (0.4)  =  0.2 

2.  A  device  is  to  operate  until  failure.   The  pdf  in  a  model  describ- 
ing the  time  of  failure  is  f(t)  =  e"   0  «:  t,  where  t  is  in  hours.   What 
is  the  probability  the  device  will  fail  in  the  first  two  hours? 

Answer:   In  order  that  e-t  be  of  the  form   A  e      ,  A  must  equal 
one,  that  is,  the  cdf  is  F(t)  =  1  -  e_t.   Thus  P  Fail  in  first  2  hrsl  = 
1  -  e'2  =  1  -  0.135  =  0.865 

3.  A  resistor  is  installed  in  a  fire  control  system.   The  pdf 

describing  the  time  to  failure  of  the  resistor  is  f(t)  =   A  e"    ,0-=  t 

where  t  is  in  years.   If  the  MTTF  is  15  years  what  is  the  probability  the 

resistor  will  last  6  years? 

A  =  JL_  =     -L     =   0.07 
MTTF    15 

P  J  Resistor  last  6  years  I  =  1  -  P  J  Resistor  fails  in  6  years  I  = 
1  -  [l  -  A  e"  A  t  ]  =  !  .  [  !  .  o.07  e"(0.07)(6)  j   = 

1  -  [  1  -  (0.67)  (0.3)]   =  1  -  0.8  =  0.2 

4.  The  reliability  of  a  device  during  a  mission  depends  on  T,  the 
time  the  device  has  been  in  storage  before  the  start  of  the  operation. 
In  a  probability  model,  A  is  the  event  the  device  survives  the  mission, 
f(t)  =  e-t  is  the  pdf  of  T.   Find  P(A)  for  a  6  month  mission,  where  T  is 
in  years. 

F(t)  =  1  -  e_t,  P(A)  =  0.6 
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POISSON  DISTRIBUTION 

Many  physical  phenomena  can  be  described  by  the  Poisson  distribution. 
Emission  of  particles  from  a  radiating  source,  arrivals  of  messages  in  a 
CIC,  and  submarines  transiting  a  barrier  are  a  few  of  the  shipboard  ASW 
situations  that  may  require  this  distribution  for  analysis  of  the 
problem. 

The  basic  notion  of  this  distribution  involves  a  stream  of  "customers" 
arriving  at  a  "service  point"  at  a  constant  average  arrival  rate.   It  is 
assumed  also  that  arrivals  are  independent  in  the  sense  that  the  number 
of  arrivals  in  any  given  time  interval  is  independent  of  the  number  of 
arrivals  in  any  other  interval  which  does  not  overlap  the  first. 

The  Poisson  distribution  is  given  by 

p(k,  A  )  =   A  k  e-A   ,  k  =  0,  1,  2,... 
k! 

where  A  >  0  is  some  constant. 

In  the  applications  of  the  Poisson  distribution  that  are  of  primary 

interest  in  ASW,  the  Poisson  distribution  represents  the  number  of 

"arrivals"  in  the  given  time  interval.   For  example,  if  time  is 

measured  in  hours,  the  given  time  interval  is  a  certain  ten-hour  period, 

and   A  =  0.5,  then  the  notation 

p(2,0.5)  =  (0.5)2  e~0-5  =  0.076 
2! 

is  interpreted  to  mean: 

"the  probability  of  exactly  two  arrivals  in  the  given  time  period  is 
0.076." 

The  length  of  the  time  interval  is  not  apparent  in  the  expression 
defining  p(k,  A  ).   Suppose  the  basic  unit  of  time  (this  may  be  a  second, 
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a  minute,  an  hour,  a  day,  a  year,  etc.)   has  been  chosen.   Let  T  be  the 

length  of  the  given  time  interval,  i.e.  the  number  of  basic  time  units. 

Then  the  A  in  the  Poisson  distribution  has  the  property  that   A /T  is 

the  mean  arrival  rate,  expressed  as  the  number  of  arrivals  per  unit  of 

time.   That  is,  if  we  denote  the  mean  arrival  rate  by  Q   ,    then 

0  =   A/T,     or  A  =  a   T. 

Therefore,  for  applications  other  than  the  Binomial  approximation 

indicated  next,  we  use 

P<k,  0  T)  =  e"  °T(  0  T)k  k  =  o,  1,  2,  ... 

k! 

to  express  the  probabilities  for  the  various  possible  number  of  arrivals. 

This  distribution  has  a  mean,  \l     =  A  ,  variance,  rf      =  A   and 
standard  deviation,   U  =  Y  A 

In  addition  to  having  its  own  applications,  the  Poisson  distribution 
also  provides  us  with  a  close  approximation  of  the  Binomial  distribution 
for  small  k,  provided  that  p  is  small  and   A  =  np.   This  is  indicated 
in  Table  7  where  n  =  100,  p  =  1/100  and  A  =  np  =  1. 


k 

0 

1 

2 

3 

4 

5 

Binomial 

0.365 

0.360 

0.185 

0.0610 

0.014c 

0.CC29 

Poisson 

0.365 

0.368 

0.154 

0.0613 

0.0153 

0. 00 307 

Table  7:   Comparison  of  Binomial  and  Foisscn  Values 

The  Poisson  distribution  can  probably  best  be  understood  through 
examples  and  problems. 
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1.  Find:   (i)   e'1'3,  (ii)   e~2'5. 
From  Appendix  A,  page  A-2, 

(i)   e"1-3  =  0.273 
(ii)   e-2.5  =  0.0821 

2.  For  the  Poisson  distribution  p(k,  A  )  =    Ak  e~A      find 

k! 

(i)  p(2,l),      (ii)   p(3,i>),      (iii)   p(2,.7) 

(i)  p(2,l)  =  I2  e"1  =  £±  =  0.368  =  0.184 
2!      2      2 

(ii)   P(3,%)  =  0.013 
(iii)   p(2,.7)  =  0.12 

3.  Suppose  that  in  a  particular  region  submarine  detection  times 
obey  a  Poisson  distribution,  with  a  mean  detection  rate  of  one  submarine 
every  12  days.   Calculate  the  probability  that  five  submarines  will  be 
detected  in  the  operating  area  during  a  quarter  (120  days)? 

As  indicated  above,   A  =  fl  T,  where  Of  represents  the  average 
detection  rate  (i.e.  the  average  number  of  detections  per  day,  which  is 
1/12),  and  T  the  length  of  the  given  time  interval,  so  that  T  =  120). 
Therefore, 

p[K  =  5]       -e-      qT    (g     T)k=e   -WW)    (1/12)    (120)5      . 
L  J  k!  5! 

0.03783 

4.  Using  the  information  from  problem  3,  find  the  probability  of 
detecting  five  or  fewer  (at  most  five)  submarines  during  the  quarter. 

P  fK  =  5  1   -  I    [(1/12)  (120)]  k  e-a/12>  (120)   =0.6709 
1      J     k=0  ki 

5.  Approximate  the  Binomial  distribution  problem  number  4  in  the 
Binomial  distribution  section  of  this  chapter,  using  the  Poisson 
distribution.   Answer:   0.9 
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APPENDIX  A 


Areas  Under   the  Standard  Normal  Curve   from  0    to 
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69.3344 

77.5766 

80 

51.1720 

53.5400 

57.1532 

60.3915 

64.2778 

71.1445 

79.3343 

83.130? 

90 

59.1963 

61.7541 

65.6466 

69.1260 

73.2912 

80.6247 

89.3342 

98.6499 

100 

67.3276 

70.0648 

74.2219 

77.9295 

82.3581 

90.1332 

99.3341  109.141 

-2.5758 


-2.3263 


-1.9600 


•1.6449 


-1.2816 


Q'x2.»)=[2*r(*)    '/JV^r'd 


-0.6745         0.0000         0.6745 
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